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SESSION  I 


MULTI-TASK  DECISION  MAKING 


Chairman:  R.  W.  Allen 
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REVIEW  OF  THE  SYMPOSIUM  ON  MONITORING  BEHAVIOR  AND  SUPERVISORY  CONTROL 

(BERCHTESGADEN ,  F.R.  GERMANY, MARCH  8-12,  1976)* 

by  Thomas  B.  Sheridan 

Massachusetts  Institute  of  Technology 
Cambridge,  Ma.  02139 


SUMMARY 


The  motivation  for  “^^tM  c^ilS'in- 

STI  JFTonSoU.r  -S  supervisor.  The  rapid  develop- 

the  loop  control  primary  force  which  is  causing  this  change, 

“fecial  WtSria  r of  -S.S  dec/ease  in  cost  and  site,  along  with  assoc  - 
a??d “ew  'and  more  sophisticated  developments  in  software  and  display  tech- 

n0l°8To  deal  with  these  issues,  over  100  participants  from  fifteen  countries 
.  1  .  Hnrino  8-12  March,  1976,  at  Berchtesgaden,  F.R.  Germany,  in  the 

iSS,^an  Ator  Thirty-five  paper;  were  presented  in  three  categories: 

Ip  control  faircraft,  automobiles  and  urban  mass  transit,  ships);  process 
control  fiSSttiS manure,  nuclear  reactors);  and  general  models  of 

monitoring  and  super  discussions  involving  all  the  participants  were  held 

throws: 

tile  associated^roblems  in  implementation,  and  required  new  interdisciplinary 
and  institutional  arrangements. 


INTRODUCTION: 

THE  NATURE  OF  SUPERVISORY  CONTROL 


Figure  1  characterizes  the  trend  in  controlling  complex  systems  Figure 
-Presents  control  of  a  modern  aircraft,  where  at  the  lowest  level  of 
JonlrSl  IS  hn  a  short  time  scale,  thrust,  pitch,  roll  and  yaw  are  stabilized 
relative  to  gust  disturbances  and  reference  commands  coming  from  guidance 
relative  8  the  vehicle’s  course  is  controlled  relative  to 

logic.  and  reference  commands  from  the  navigation  logic.  Finally, 

Sr£j}StE^r£*  “tomated.  in  part,  by  ground-based  a.  well  as 

of bfrd  computers.  At  each  level  in  the  multi-loop  control  hetrarchy  the 

«  Dr  Gunnar  Johannsen  of  Forschungsinstitut  fur  Anthropotechnik ,  Meckenheim, 
Germany,  was  Co-director  along  with  the  author.  The  meeting  was  sponsored  by 
the  Special  Programmes  Panel  on  Human  Factors,  Scientific  Affairs  1V'*  1 
bv  the  Federal  Republ^  of  Germany.  The  papers  and  workshop  reports  will  soon 
be  avai .able  from1 Plenum  Press  in  a  book  entitled  Monitoring  Behavior  and 
Supervisory  Control. 
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Figure  la.  Supervisory  Control  of 


human  operator  can  make  inputs.  At  the  lowest  level  he  can  provide  continu¬ 
ous  stick-pedal  commands  to  the  inner  loop  (and  does,  of  course,  during 
critical  take-ot.  and  landing  maneuvers).  At  the  middle  level  he  can  provide 
intermittent  cc-urse  changes  by  resetting  course  relative  to  a  computer- 
navigation  base.  At  the  highest  level  he  can  modify  the  flight  plan  or  mis¬ 
sion  in  major  ways,  in  coni*  action  with  persons  on  the  ground  and/or  other 
aircraft.  At  each  higher  level  the  nature  of  his  control  is  different, 
responses  becoming  more  intermittent  (longer  time  between  responses) 
and  involving  more  monitoring  and  more  interaction  with  computers  and  with 
other  persons . 

Actually  it  was  probably  in  spacecraft  where  the  multi-level  control 
first  was  implemented  to  any  degree  of  sophistication  and  where  one  might 
say  we  first  saw  "supervisory  control".  It  is  rather  arbitrary  to  set  the 
number  of  levels  in  the  control  heirarchy  at  three  (except  that  the  con¬ 
junction  of  the  three  words  "navigation",  "guidance"  and  "stabilization"  has 
come  into  common  use  in  the  aerospace  industry).  In  the  Apollo  spacecraft 
one  can  list  many  more  levels  of  control  for  some  functions,  especially 
if  one  considers  nested  control  loops  in  computer  software,  layers  of  backup 
or  abort  modes,  etc. 

Turning  to  a  different  kind  of  system,  the  chemical  plant,  one  finds 
a  similar  heirarchy  or  nesting  of  control  loops,  with  successively  longer 
time  constants,  successively  less  continuous  real-time  automatic  control,  and 
successively  more  human  interaction  with  higher  level  control.  Figure  lb 
suggests  three  levels  of  control  associated  with:  "valve  control"  to  regu¬ 
late  instant-by-instant  temperature  and  pressure;  "mixing  logic"  to  effect 
longer  term  mixing  procedures;  and  "plant  management"  to  control  the  major 
steps  of  producing  the  given  product. 

A  third  example  (Figure  lc)  is  found  with  the  industrial  robot,  as  used 
for  manipulating  and  assembling  discrete  parts  on  the  production  line.  At  the 
lowest  level  conventional  servomechanisms  control  instantaneous  forces  and 
positions  of  the  actuators  for  each  degree  of  freedom.  At  a  middle  level 
a  computer  decides  what  sequences  of  positions  and  forces  for  the  component 
degrees  of  freedom  to  command  in  order  to  achieve  certain  elementary  manipu¬ 
lations  ("therbligs")which  are  programmed  from  above,  like:  reach,  grasp, 
insert,  release,  etc.  The  highest  level  the  control  system  coordinates  is 
the  robot's  accomplishment  of  various  tasks  in  conjunction  with  the  production 
line,  special  parts  feeders,  other  robots  or  human  workers,  etc.  Again,  most 
of  the  human  effort  is  in  monitoring  and  supervising  at  the  highest  control 
level, though  in  case  of  emergencies  or  for  maintenance  or  repair  the  human 
operator  may  intervene  into  automatic  loops  and  apply  direct  control  at 
lower  levels . 

Figure  2  illustrates  supervisory  control  from  the  viewpoint  of  the 
human  operator.  He  observes  displays  which  may  present  pictorial,  graphic, 
or  alphanumeric  information.  In  the  near  future  he  may  also  listen  to 
computer-generated  speech  information.  He  operates  hand  controls  which  may 
be  of  the  symbolic  type  (keyboard)  or  analogic  type  (joystick,  light  pen). 

In  the  near  future  he  may  speak  in  specialized  code  and  be  recognized  by  com¬ 
puter. 

The  characterization  in  Figure  2  shows  a  "local"  control  loop  in  which 
the  human  operator  can  test  his  plans  and  programs  before  committing  them 
to  action.  This  is  analogous  to  the  way  the  chief  executive  uses  planning 
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staff  in  the  managerial  heirarchy  of  any  large  organization.  After  both  plans 
and  communications  (teaching  or  programming  in  this  case)  have  been  worked 
out,  the  chief  executive  orders  his  line  organization  into  action  (via  the 
supervisory  link  in  Figure  2),  He  then  monitors  what  happens  as  his  sub¬ 
ordinates  (in  this  case  the  "remote”  control  loop,  with  its  own  sensors, 
delegated  decision-making  powers,  and  capability  to  effect  its  own  actions  ) 
take  over,  reassuming  control  in  emergency  or  when  his  instructions  have 
been  executed. 


EMPHASES  AND  POINTS  OF  CONSENSUS 


The  following,  in  the  opinion  of  the  author,  constitute  the  major  high¬ 
lights,  emphases  of  the  papers  presented  at  the  Symposium,  and  points  of 
consensus  which  emerged  from  the  workshops.  The  reader  is  referred  to  the 
full  papers  and  workshop  reports,  forthcoming  from  Plenum  Press, 

Multiplicity  of  Interfaces 

Understanding  monitoring  behavior  and  supervisory  control  requires 
observation  and  analysis  at  a  multiplicity  of  system  interfaces.  Figure  3 
(after  E.  Edwards  of  the  University  of  Loughborough,  England)  is  a  con¬ 
venient  way  of  portraying  these  interfaces:  the  conventional  man-machine 
interface  between  a  human  operator  ("liveware")  and  the  hardware;  a  linguistic 
interface  between  him  and  the  software;  a  physiological  interface  between 
him  and  the  environment;  and  a  social  interface  between  him  and  other  persons 
in  the  system.  Edwards  called  this  the  "SHELL"  model  for  obvious  reasons. 


Use  of  Simulators 

The  papers  and  workshops  affirmed  the  increasing  use  of  simulators  in 
complex  man  and  computer-controlled  systems.  A  first  reason  is  that,  because 
of  the  rich  electronic  intercommunication  of  component  subsystems  in  simu¬ 
lators  and  the  availability  of  computers  intrinsic  to  the  system,  it  is 
relatively  easy  to  do  measurements,  data  storage,  data  correlation,  and  even 
on-line  modeling  of  human  behavior.  A  second  reason  for  simulation  is  that 
it  is  the  best  way  (perhaps  the  only  viable  way)  to  provide  human  operators 
exposure  to  low  probability  events  -  which  is  one  of  the  major  reasons  that 
human  operators  are  included  in  such  systems. 


Workload 

There  was  a  great  deal  of  discussion  about  human  operator  workload.  Four 
alternative  definitions  of  workload  were  evident.  A  first  definition  is  in 
terms  of  what  the  task  demands  are  for  "satisfactory  performance"  (required 
reduction  in  uncertainty,  positioning  speed  or  accuracy,  forces  required, 
etc.).  A  second  is  in  terms  of  the  level  of  effort  or  difficulty  as 
perceived  (subjectively)  by  the  operator.  A  third  is  in  terms  of  what  the 
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ODerator  actually  accomplishes,  or  some  objective  measure  of  his  decision 
operator  actu  y  J  or  force>  A  fourth  is  in  terms  of  risk  or  actual 

d«ri»e«  totis  health  or  future  capabilities.  The  cc.ot.  technique  of 
defS'f  «o?Uo!d  S  the  negative  of  "spar,  capacity"  (objeetiv.  perfocance 
on  a  "side  task")  would  fit  the  third  category.  The  greatest  problems  m 
workload  analysis  seemed  to  be  in  finding  operational  definitions  which  are 
common  across  a  variety  of  tasks. 


Internal  Models  of  External  Systems 

Keen  interest  in  "internal  modeling"  was  evident.  The  internal  model 
is  as  old  as  the  ancients:  the  notion  is  that  somehow  the  brain  has  an 
internal  representation  of  structures  in  the  external  world  -  of  which  it 
«n  ask "what  would  happen  if  ...  "  and  get  reasonable  answers.  In  more 
recent  years  "plant-model"  controllers  and  Kalman  filters  have  become  useful 
in  control  and  human  operator  modelers  have  hypothesized  that  people 
nlleule  of  Sch  Eternal  models  too  -  at  least  that  is  a  convenient  normative 
base  Som  which  to  model.  For  example,  the  well-known  "Kleinman-Baron-Levison 
optimal  control  model"  of  the  human  operator  includes  a  Kalman  predictor 
as  an  internal  model,  and  so  do  recent  failure-detection  models  of  Curry  and 

Gai  When  considering  man-computer  cooperation  in  supervisory  control  systems, 
the  oroblem  arises  whether  the  human  operator  needs  an  internal  model  not 
onlyTf  5he  process  or  vehicle  to  be  controlled  but  also  of  the  decision¬ 
making  processes  which  involve  the  computer  serving  as  a  ower  eve 
trolllr  Such  a  consideration  elicited  further  concern  by  some  Symposium 
narticioants  that  the  computer  needs  not  only  an  internal  model  of  the 
P  rlLc«Ptn  do  its  control  job ,  but  must  also  have  some  model  of  what  the 
human  is  likely  or  able  to  do  in  order  that  it  can  decide  when  to  relinquish 
StroJ  to  ^e  human  or  take  back  control.  More  is  said  of  this  man-computer 
responsibility  problem  later. 


Representation  of  Goals  and  Performance  Criteria 

Closely  associated  with  the  "internal  models"  discussions  were  those 
of  performance  criteria,  objective  functions,  cost  functions,  payoff  functions, 
tradeoff^unctions  (and  other  terms,  all  of  «hich  mean  the  spee.fication  of 
what  is  good  and  what  is  bad  for  a  particular  system). 

It  is  not  easy  for  the  experimenter  to  communicate  to  an  experimental 
subject  a  precise  performance  criterion  or  objective  function,  nor  can  a 
human  operator  in  an  actual  system  explain  precisely  what  function  of 
relevant  variables  he  is  seeking  to  maximize.  Attempts  to  conuminicate  such 
functional  relationships  in  mathematical  form  almost  always  fail  because 
most  people  simply  don’t  understand  their  own  behavior  in  such  terms.  Thus 
the  experimenter  or  the  analyst  of  actual  system  behavior  must  infer  the 
performance  criterion  actually  being  used  -  either  from  observation  of  actual 
system  behavior  or  from  a  battery  of  subjective  judgments  which  the  subject 
says  he  would  make  if  confronted  with  given  particular  situations. 

y  if  inference  cf  performance  criterion  is  made  from  subjcwt*/*.  judgm-r.^. 
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i  mes  which  are  under  active  development.  One  of  these 

there  are  several  techniq  „  _  * technique  developed  at  Bell  Telephone 
is  "multidimensional  scaling  •  •  ective  judgment  ofdissimilarity  between  multi- 

Laboratories  and  based  upon^subjective  Judg^en^  ^  ^  ,^lpal  - 

dimensional  stimuli.  ^  ,  second  technique  is  "multi-attribute  utility  , 

f  ff^Ye  ”  "<*  othlrs  aidtsTon  indifference  Judgments 

developed  by  Raiffa,  Kee  : y_„<.r._f.t  t0  lotteries.  It  yields  multi-dimensional 
of  utility  Cor  worth)  wi  APthird  technique  called  "interpretive  structural 

TodeUn  ••  wa  evelopeJ  S'warfield  and  his^ol  leagues  at  Battelle  Institute. 

modeling  was  transitive  orderings  can  be  made  among  stimuli.  A 

and  is  appropriate  where  transitive  ord  ^  q£  direct  worth  asses. 

^.^To^Ml  of 1  these  Unices  have  interactive  computer  pro¬ 
cedures  to  accompany  them. 

Tradeoffs  in  System  Strategy 

x.  _  Svmnosium  also  dealt  with  what  might  be  called  "system  strategy 
tradeoffs  ^^question^of  »nage..n.  philosophy  regarding  overall  strategy 

E”  "eriSpft-o^S  tradeoffs  are  «ost  i.portant.  The  first  is  the  question 

of  "1“ive,I’°r",ofs^15"he  hE  operaior  always  have  the  upper  toad,  the 
“oS"  S  shoufd“ytt2  designed  endow  the  eo.pu.er  with  the  capability 
overrule  erratic  and  detrimental  decisions  of  the  operator  To  what 
extent  should  man  and  computer  if  ^preferable  that 

relative  ignorance  of  ht  h  harmony,  each  being  continually  updated 

Twi  and  intending,  These  questions  were  in  no  way 

.  Rl,t  the  need  for  further  research  was  evident. 

reS  *  clr-ond  "strategy  tradeoff"  is  between  acquiring  enough  data  to  be 

A  second  strategy  tr  ^  anticipation.  This  is  a  somewhat  older 

C°nMlmnof°cvbirie?ics'  and  does  have  a  quantitative  solution  provided  the 
event  states  are  we  1  defined,  and  all  the  appropriate  prior  and  contingent 
probabilities^^  "hL  events  are  known,  as  well  as  the  cos.s/benefits  of 
possible  outcomes. 


Tradeoffs  in  Modeling 

Ouestions  of  modeling  philosophy  were  of  particular  interest  because,  as 
^  „  romnlex  and  the  role  of  the  human  operator  becomes  more 

^upSvisf^Vnd”  uHlss  amenable  to  definition,  .odeling  beco.es  .ore 

difflThtte  Questions  might,  again,  be  posed  in  terms  of  trading  relations: 

a)  Mathematical  models  of  man-machine  systems  are  complex  and  lncompre- 
aj  Matnematic  with  and  lately  are  becoming  more  so.  Yet 

h*nSlb*®  trad itioL Verbal  model  of  classical  behavioral  and  social  science 
the  more  traditional  as  particuiariy  useful  in  doing  an  engineering 

job°  *]£  thereVpractical  compromise  between  mathematical  complexity  and 
verbaWmbiguity^d  experiments  require  a  constraint  and  distortion  of  reality. 
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*-  ST£  Sr  S;  ir,cha«^iSftoitrSb«S 

srTry.mle5^^^ 

£?  ££££-»  enable  human  operators  to  behave  and  be  .od.led  as 

?'“■  CnrineerinE°areas  SeTti/eTS™  have  been  efforts  to  Stan- 
d)  As  engineering  measures,  etc.  across  the  protessional 

dardite  certain  model s.^definit^ti^n  be  lntro<laced  in  this  field,  is  it 

community.  r  should  some  standardization  efforts  be  pursued, 

impossible  or  Pre^ure,  criticized  as  being  "mechanistic", 

eJ  Man-machine  and  theref0re  presumably  out  of  date, 

esuSSliy°«  le  move  toward  supervisory  control.  The  opposite  might  be  a 

cS  ian"To  characterize  the  human  supervisor.  Can  modeling  m 
sperms  be  useful,  and  what  compromises  are  appropriate? 

Ethical  Implications 

„  •  talk  these  days  of  making  individuals,  companies,  and 

There  is  much  talk  thei seazys  >  ^  ..productivity..  is  considered 

nations  more  produc  •  j aKjutv  etc.  At  the  Symposium  there  was 

in  functional  t®™*  T  among  the  Scandanavian  delegates,  about  productivity 

great  concern  and  whether  work  is  meaningful 

St  changes5 is  the  computer  making  with  respect  to  job  satisfaction, 

«*  °2lyth1  SaSpSr°SSsSSd“uatSttSSy  ITZIZ  the  human 

ally  's%^'7Sless  anf fess’ direct  effect  on  the  prcduct,  is  more  and 
STpec^tel  SSs  less  and  less  understanding  of  the  production 

P”CeThe8Su&  in  this  problem  area  seemed  a  bit  Mire  significant  than  I 
remember  them  to  have  been  in  previous  meetings. 
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HUMAN  CONTROL  AND  MONITOR INC-MODELS  AND  EXPERIMENTS 
By  P.H.  Never inke 

National  Aerospace  laboratory  (NLR) 

summary 


This  naoer  deals  with  the  results  of  a  theoretical  and  experimental 
ram  concerning  human  monitoring  behavior.  Apart  from  monitoring  an  auto¬ 
matic  approach,  combined  monitoring  and  manual  flight  director _ control  <*as^ 
I^termine  the  mc--rferer.ce  between  subtasks.  Also  simultaneous.^ 
monitoring  and  auditory  tracking  was  included. 

nv<=  ^  d on-on'3 1 ’•ate  that  the  multivariable  monitor  model  adequately 

describes"  human  behavior  in  the  aforementioned  tasks Furthermore ,  a  multivar: 
able  workload  model  is  developed.  Computed  workload  is  shown  to  agree 
excellently  with  subjective  ratings. 


INTRODUCTION 


^procerSols  for  describingThese  manned  vehicle  systems.  Apart  from  con¬ 
trolling  the  system  (to  which  situation  considerable  modeling  effort  has  been 
*-voteX  the  pilot  often  has  to  fulfil  ether  crucial  functions  such  as 
th.  automatic  d=U5ics, 

etc  insight  in  this  higher  mental  functioning  is  still  ratner  .ncomp.--- 

seveLl  attempts  have  been  mad.  to  inve.tigMe  end  model  signal-  ana 
failure  detection  behavior. 

T-oe  object  of  this  study  was  to  describe  human  monitoring  behavior  and 
to  determine  how  it  is  affected  by  performing  other  tasks  (interference),  .or 
Sis!  a  model  for  monitoring  multivariable  systems  has  been  developed  which 
can  be  considered  as  an  extension  of  the  researen  of  Levison  et  al  (Ref.  . 
Essentiallv  it  is  a  cascade  combination  of  the  subjective  expected  util-ty 
model  (Refs!  2  and  3)  and  the  state  estimation  submodel  of  the  optimal  contro 

model . 

The  model  has  been  tested  against  experimental  results  of  a  fixed  base 
simulator  program,  dealing  with  monitoring  the  automatic  approach  of  a  DC-8.^ 
Also  combined  monitoring  and  manual  flignt  director  contro  Using  p 
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control  model)  was  'ff ff f£“gf  an  ifrefe”»«  TTor'th,  same 

reason^  fhfmonitorfaskfwere  co.bin.d  with  an  auditory  tracking  task. 

in  section  2  the  human  operat ;»  h^f  ^fffffraf  anf  f c^iff' 
Section  3  contains  a  discussion  -resection  k  pilot  workload  is  analysed 

of  the  predicted  and  »“?“f mSSf  is'd'flofd L  accordance  with  the  task 
St^TSS* ‘ mSd.l  suggested  in  reference 


human  operator  models 


This  section  ^arises 

“ft  f  c“of  f  he'models  deal  with  human.control-,  monitor-  and  decision 

making  behavior  as  well  as “^fyf Sffimal  control  model  (OCH) 

Human  control  behavior  (Ref  t).  This  model  consists  of  a  perceptual 

developed  by  Kleinsum and  Baron  (Bef. related  t0  the  "cerceived" 
model  indicating  how  the  disp  y  y  model  describes  how  the  internal 

rf  resentatf  n  fthftask  envi^nment  r.sults  in  the  actual  control  input(s). 

Human  decision  making  behavior  is ^described^by^the^perceptual^model^ 

however  now  m  a .ca!£?de fdeffefdecting  how  "perceived"  information  results 
Iff  optioL.2(binary)3decision  strategy,  will  be  briefly  discussed  in  the 
next  chapter. 

Interference  between  several  tasks^i^accounted^foreby^he^asK  ^'5). 

ference  model  developed  a°  P3^  thg  present  study  not  only  to  predict  the 
This  model  plays  a  major  r  £hcc£trol_  and  decision  maxing  tasks  but 

interference  and  total  workload  involved  in  performing  the 

SkstffrfS  raasonf feffk  int.rf.ranc.  modal  is  bri.fly  bussed  » 

Sfopf a«r  workload  is  asprassad  inarms  udy 

corresponding  with  each  su  ^odel  {n  order  to  predict  human  operator  work- 

load^corresponding^ith  multivariable  control-  and  decision  making  tasks. 
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Decision  making  model 


Perceptual  model 


Decision  making  is  assumed  to  be  based  on  the  internal  representation, 
of  the  state  of  the  world,  x.  This  internal  model  of  the  system  state,  x,  is 
based  on  the  '’perceived"  variables,  yp  according  to 

y  (t)  =  y(t-T  )  +  v  (t-r  )  (1) 

P  ■> 

where  y  represents  the  "displayed"  variables,  t  is  a  lumped  "equivalent 
perceptual* time  delay,  and  v^  is  a  vector  of  independent,  white,  Gaussian 

observation  noises.  The  autocovariance  of  each  noise  component  appears  to 
vary  proportionally  with  mean-squared  signal  level  and  may  be  represented  as 

V  (t)  =  t  ?  E  yf (t)  (2) 

y  i  *i  I  ^ 

where  P  is  the  "noise/signal"  ratio  corresponding  with  the  fractional 

attention  paid  to  variable  y,  (see  chapter  2. a). 

Denoting  the  covariance  of  the  difference  between  the  instantaneous 
value  of  the  state  vector  and  the  estimate  o*  it  (e  (t)  -  x  (t)  -  x  (t)J  by 
v  pair  (x,  1  )  constitutes  a  sufficient  statistic  to  test  Hypotheses 

about  x,  based  on  the  data  • 

Subjective  expected  utility  model 


It  is  assumed  that  the  human’s  decision  strategy  is  reflected  by  the 
following  stages  (Refs.  2-M  • 

formulate  (U)  possible  hypotheses, 

assess  probabilities  of  all  hypotheses  based  on  the  available 

information  y  ,  ?(H./y^) 

P  - 


determine  (M)  possible  decisions, 


assign  the  utilities  to  each  hypothesis /decision  combination  U.  ^ 

determine  the  maximum  utility-decision  D*  according  to  =  D 

for  E  =  E  ,  where 
max’ 


I  U/Di )  ■  hj p( vv 
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Manv  decision  making  situations  involve  binary  decisions  (the. choice 
between  tvo  hypotheses,  e.g.,  the  probability  of  a  successful  landing  is  large 
enough  or  not).  Both  the  following  analysis  and  experimental  program  deal  with 
this  class  of  decision  tasks. 

Now,  let  the  mutually  exclusive  hypotheses  HQ  and  H,  correspond  to  the 

„„  v  t  R  and  x  t  R,  respectively,  where  R  is  defined  as  some  region 

(or" window)  in  the  sta^e  space.  Using  eq.  (3)  toe  human's  decision  is  there- 
fore  given  by 


. .  A 


D  '  “  «W 


D  =  Dq  otherwise 


U00~U10  =  ^0 
u11-u01  U1 


(U) 


In  the  following  the  target  region  R  is  given  by  a  multidimensional 


"window";  then  for  H  can  be  written 


H0  :  -V_.  <  Xi  <  *TU.  f°r  a11  if  i=1>' 


(5) 


where  x^  and  xTU  represent  the  lower  and  upper  target  boundary,  respecti¬ 
vely.  Commutation^  eq.  (*0  requires  the  posterior  probability  of  hypothesis 
H„ 


^Vj .  ^TU 


P  (H0/yB)  =  J 

XTL, 


J  p(x/y^)  dx1...dxn 
:cTL 


(6) 


where  p(x/y  )  is  (assumed  to  be)  Gaussian  with  mean  x  (maximum  likelihood 

estimate)  aL  covariance  1  .  For  the . computation  of  various  measures  of 
decision  performance  it  is  convenient  to  define  the  huna-s  decision  space 

DS  using  eq.  (U) 


puyy^  Uq 

'  ■  U! 


p(Ho/yP- 


or 


p(HoV  =  u0+u1 

Combining  eqs.  (6)  and  (8)  yields 


(T) 


(8) 
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p  (  A,  I  )  dx1  ...  d>.n  =  U  +U 


where  p(il)  represents  the  normal  probability  density  function  with  mean  ft 
^representing  the  elements  of  DS)  and  covariance  I  .  How  tne  probability  that 
decision  DQ  is  made  can  be  computed  according  to 

?(Dq)  »  P  (x  <  DS)  (1C' 

The  probability  of  wrongly  deciding  DQ  is  given  by 

?(H1,Dq)  =  P  (x  <  R|  x  <  DS)  (r‘) 

Alternatively,  the  probability  of  wrongly  deciding  D1  can  be  computea 
according  to 

=  P  (x,  R,  x  /  DS)  v12) 

"oregoing  theoretical  decision  making  measures  will  be  compared  wit n  tne 
corresponding  measures  obtained  in  the  experimental  program  aiscussea  ~r. 

section  1 >. 

Task  interference  model 

Task  interference  is  modeled  in  terms  of  tne  following  reiaticnsr.ip 
fraction  of  attention,  f  paid  to  auctaak  (indicator)  i,  --'.a  tne 
corresponding  human's  internal  n-iise/signcil  ra^io,  ^ 

P.  =  P  /f -  ^ 13' 

1  °i  1 

where  P  is  the  ratio  corresponding  to  single-task  performance  (  'full 

attertioi") .  Furthermore,  it  is  assumed  that  the  amount  of  information-pro¬ 
cessing  capacity  is  determined  by  the  demand  of  tne  subtasks  ana  no.  oy  .... 
amount  of  subtasks  to  perform.  In  formula 


f.  =  1 

l 


This  model  developed  hy  Levison  et  al,  has  been  partly  ^ali^ated  for  some 
multivariable  control  situations  (Ref.  ?)  and  dual  decision  making  tas^s 
7*ef.  M.  In  the  present  study  the  model  will  be  tested  xn  multivariable 
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hypotheses-eituations,  and  in  combined  control  and  decision  .Aids  tasks 
(both  interacting  and  non-interacting; . 

In  reference  5  the  concept  of  factional  at model 

fr^i^.”^--^oia“n  Sir/SSu«i«ns)  «U1  be  presented  and 
vK,wh  „Hh ’experimental  data  (subjective  ratings).  The  model  will  be 
d“c“«d  in  seSloH,  because  it  has  not  been  used  to  predict  the  erne- 

rimental  results. 


EXPERIMENTS 


Experimental  set-up 

in  order  to  investigate  monitoring  behavior  for  various  task  situations, 
the  following  single-task  configurations  were  examined  in  the  expenmen 
program  (figure  1) 

.  monitoring  the  fast/slov-  and  the  glideslope  indicator  during 
an  automatic  approach  of  a  DC-8  (indicated  by  M2). 

.  monitoring  the  fast/slow-,  the  glideslope-  and  the  localizer 
indicator  (M3)* 

manual  (flight  director)  approach  (only  longitudinally)  of  a  DC-8 

(C). 

.  auditory  presented  unstable  first  order  tracking  task  with  a  time 
constant  of  1 ,sec  \A;. 

«  +wa  ta^ks  were  included  to  validate  the  afore- 
Also  combinations  ~  -  ■both  control  and  monitoring  tasks, 

mentioned  task  interferen  task  and  the  tw0  monitoring  tasks  results 

Combining  the  longitudinal  co  other  way  the  prior  probabilities  of 

in  'W  inaggl||  coSrol  bchuvior.  The  combinations  of 

the  mon.  or  tasks  depend,  o  .  •  t  k  were  included  to  study  the 

combined  task  configurations  are  M2C,  M3w  M2A  ana  M3A. 


Single-task  configurations 

_  •  +  +--:onvy.gm  i  the  decision  making  tasks  were  intended  to  repre- 

sent  “ VXTof  kcSL^betber  on  it  be  in  within  tbe  ''banding 
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window  'f  acceptable  deviations.  The  subject  depressed 

f^sponfe  b«“n  vheneveAe  decided  hypothesis  H,  to  he  true  (one  or  .ore 
indicators  outside  their  region  of  acceptance). 

Th,  iong'"  tudinal  control  task  was  a  manual  approach  tracking  task 
i  *  f un.*-  the  time  varying  aspect  involved  in  an  actual  approach  and  landing 
(wi.hou._the  time  director  (FD  in  figure  1).  The  flight  director  design, 

autopilctnchL-aitlristics  and  turbulence  levels  are  derived  from  reference  6. 

^  «•»  s^hi“?hcapoA!vedi:=rtpScS°s:; 

characteristics  were  ucn  tha.  the  percei  P  vas  included,  so  the 

hy  -  isometric  side- 

arm  controller. 


to  m 


Measures,  subjects  and  procedures 


+he  decision  making  tasks  the  measures  aiscussea  in  se-.ion  2  were 
^-.n  ^cki-  performance  was  measured  in  terms  of  the  relevant  vanance- 
deviations  flight  director  deviations,  speed  deviations,  stick 
scores  .I^e/iations,  Il-0  Furthermore,  pilot  workload  was 

£ SSS'rJS1  rSL  on  the  scales  precepted  in  table  1 

of  the  dual  task  configurations,  an  overact  impression  of  the  total 
task  difficulty  was  given. 


q  ^  lt)  at  e  d 


3  xn  en  ne  r.  * 


mVw; 


su: 


i  0C  t  S 


SSTpi;  iub]«t!°Se'dSatiS“f  “d  the  order 

Ir  K^bS  co„Sol-rSSd«cisionJma)ir.s'‘t2“i  subjects  were  iustruc- 
the  control  hash  as  well  as  possible  and  to  spend  their 
"reserve  capacity"  on  the  decision  making  task. 


Theoretical  and  experimental  results 


Since  space  does  not  permit  an 
results*,  only  the  principal  results 
model  predictions. 


extensive  nresentaticn  of  the  experimental 
will  be  discussed  corresponding  with  the 


*  These  will  be  contained  in  a  later  report. 
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Single-indicator  tasks 


In  order  to  obtain  noise/signal  ratios  corresponding  to  monitor  single¬ 
display  indicators  ("full  attention")  a  base-line  experiment  was  conducted 
to  "calibrate"  the  displays.  Three  subjects  performed  each  Ox  the  three  single- 
indicator  decision  making  tasks  two  times.  The  measured  decision  making  scores 
are  shown  in  table  2.  Two  model  parameters  were  varied  to  ma.ch  th-  corres¬ 
ponding  scores:  the  observation  noise  level  associated. with  the  displaye 
variable  and  the  utility  ratio  U^U  .  The  perceptual  time  delay,  r ,  was  kept 
constant  (0.20  sec).  The  resulting  model  scores  are  also  given  «  table  2. 
Although  the  subjects  were  instructed  to  weigh _  miss  -errors  and  false  alarm  - 
errors  equally  and  to  minimize  the  total  decision  error,  Pg,  they  were 

apparently  somewhat  reluctantly  in  deciding  "in"  (U  /U  =1.75). 

Based  on  this  result  U,/U  -1.5  was  useg  for  the  subsequent  model  predictions. 
The  resulting  observation°noise  levels  reflect  display  phenomena  and/or 
"indifference"  thresholds.  These  values  were  assumed  to  be  the  ratios 


corresponding  to  "full  attention 

Tiding  eq.  (13)  the  actual  noise/lignal  ratio  could  he  determined  given 

,  _  ~  ’>  . .  r  -  •  ™  ^  *  to  indicator  i  for  the  multivariable 

the  fraction  01  a^enw~on,  pa*-,  -w 


decision  making  land  control)  tasks. 


Multivariable  decision  tasks 


Decision  scores  were  predicted  for  the  M2-  and  M3-tasks  assuming^ an ^ 

measured" and"oredicted  cores  (Table  3^  reveals  an  excellent  agreement  between 
allS corresponding  scores.  The  effect  of  the  ratio  on  the  decision  making 

scores  is  shown  in  figure  2.  As  expected  the  probability  of  deciding _ "in"  as 
well  as  the  joint  probabilities  are  relatively  sensitive. to  this  ratio; 
however,  the  total  d|gision  error,  Pg,  increases  only  sligntiy  wnen  0,/Uq 
varies  from  1  to  1.5 

Decision  making  and  non-interacting  control 


Next  the  combined  auditory  tracking  and  decision  making  tasks  are 
considered.  The  following  procedure  was  used. 


*  Observation  noise  levels  corresponding  with  indicator  position  are  modi 
fied  because  of  the  non-zero  references  (Ref.  *0  . 

**  The  assumed  value  of  1.5  turned  out  to  be  rather  close  to. the  actually 
"measured"  ratio.  Pilot's  comments  clarified  why  this  ratio  was  larger 
than  one:  after  being  out  the  "window"  they  wanted  to  be  sure  before 
deciding  "in"  again. 
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1  „aia  nf  the  single  audio  task  were  determined  by  matching  the 

Th*  Ttcores  to  the  measSf  ones  using  figure  3;  these  values  correspond  to 
model  scor  was  done  for  the  audio  tasks  when  performed  in 

fUV  at  ion  v-th'the  decision  tasks.  The  corresponding  fractions  of  attention 
frfiere  used  l  predict  the  attention  paid  to  the  decision  tasks  (1-f,). 

(f  j  were  us  scores  are  given  in  table  3.  Comparing  these  scores 

-he  resulting  def sl0“  a^e  f ^  acdel  -edicts  a  larger  interference 

with  the  measured  errura  =ho  -  “  ““  t00  large  for  both  142(A)  and  M3(A)). 

S. “oK  Sing  figurL  1  end  5  containing  the. 
.•latching  the  neas  versus  fraction  of  attention  results  in 

?ne°acSaCl  fraSi:;  °0f  Mention  paid  to  M2(A)  of  0.7  and  to  M3(A)  of  0.73. 

The  explanation  for  the  smaller  interference  is,  of  course,  the  fact 

t^at  the  decision  task  involves  visual  information  and  that  the  control  task 
ti.at  tne  aec^sj-u  .-re  interference  between  visual  and 

13  -ss ’ than  between  two  visual  tasks.  It  is  particularly 

aural  information  is  ~ss  interference  for  the  combined  M2(a) 

'TsUTTui  "  "slits  Tor  “3(A)  task:  the  total  amount  of  information 
*tv  is  for  both  configurations  i.2.  Based  on  these  results 
HSS  St  the  Ssh Snterf  eSnce  cod,,  has  to  he  pasted  for 
3+-*  tasks  7or  combined  visual  and  auditory  tasKS  the  total 

amoun^of^information  processing  capacity  is  tentatively  hypothesized  to  he 
1.2  (instead  of  1  (2)  for  full  (no)  inter! erence ) . 


Decision  making  and  interacting  control 


The  thi~d  set  of  configurations  was  incluaea  to  determine  the  extent 
•  •  v. t-L  tcc>  interference  model  would  hold  for  simultaneous  control  and. 

'“nKh.  .7"  h  n _ ^ntrol  behavior  affects  the  (a  priori)  decision  maxing 

sSustLr^  Sis  Teason  the'  aforementioned  split-axis  manual  approach  and 

^tualioSwasShatSSSoI  behavior“vaTbased‘on  the  perception  of  the. flight 
lirecSr  and  deSsiS  mSing  on  the  basis  of  the  perceived  monitor  indicators 
(u,  d  and  y).  In  otner  w-rds,  it  was  assumed  teat  two  separate  internal 

models  were  used. 

The  same  urocedure  was  followed  as  before.  The  single  control  task  (C) 
was  modeled  by* matching  the  mcdel  scores  to  the  measured  ones.  The  resulting 
,  ( -in  fable  U)  is  obtained  lor  a  noise  ratio  of  — 1o*T  dB. 

B^SIngaoIiy  the  observation  noise  ratio  the  control  tasks  ^  performed 
In  combination  with  the  decision  tasks  (C(M2)  and  C(M3))  were  matched  The 
result  is  also  contained  in  table  U.  The  corresponding  fractions  of  attention 

.ensured  errors  4Si- 

H,0  .n(rl  task  however,  the  predicted  errors  for  the  M21CJ  task  are  consi 
derably  large*’-’ (20  %)  than  the  measured  errors.  For  this  reason  the  measured 
decision  scores  were  .etched  using  figure  7-  The  resulting  scores  (given  in 
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table  3)  correspond  with  a  fraction  of  attention  which  is  about  0.1  smaller 
than  the  predicted  fraction  (based  on  the  interference  model). 

It  is  concluded  from  the  foregoing  results  that  the  total  information 
processing  capacity  is  relatively  constant  (1.0  for  the  M3(C)  task  and  0.9 
for  the  M2(C)  task).  However,  an  interesting  refinement  to  the  model  will  be 
discussed  in  the  next  section  incorporating  the  aspect  of  pilot  workload. 


PILOT  WORKLOAD 


In  accordance  with  the  task  interference  model  and  the  fractional  work¬ 
load  model  suggested  in  reference  5,  a  multi-task  workload  model  is  proposed 
in  this  section  and  compared  with  measured  workload  (subjective  ratings). 

Assuming  the  workload  corresponding  with  performing  a  single  task  (indi¬ 
cator),  W  ,  the  fractional  workload  when  performing  this  task  in  combination 
with  other  tasks  is 

w.  -  W  F(f . )  ( *5) 

10  1 

where  F(f-)  represents  the  functional  relationship  between  the  fraction  of 
attentionidedicated  to  the  subtask  i  and  the ■ corresponding  fractional  work¬ 
load.  The  total  workload  involved  in  performing  M  tasks  is  given  by 

M 

w  =  y  w.  (l6) 

i»1 

implicitly  assuming  that  the  function  F  is  such  that  VL  is  expressed  in  units 
along  an  interval  scale. 

Subjective  ratings  were  used  which  were  obtained  on  the  adjectival 
"demand-scale"  shown  in  table  1  (Ref.  8).  This  scale  is  assumed  to  be  an 
interval  scale.  Assuming  a  logarithmic  relationship  between  f.  and  W.  results 

in  F(f.)  =  Log  cf.  where  c  is  an  empirical  constant  corresponding  to  the 

zero  of  the  interval  scale.  A  value  of  c  =  10.5  results  in  computed  total  _ 
workload  values  surprisingly  close  to  thg  measured  workload  ratings.  This  is 
shown  in  table  5  and  plotted  in  figure  8  . 

Returning  to  configuration  M2(C)  one  can  see  that  the  computed  workload 
based  on  the  predicted  fraction  of  attention  on  the  monitor  task  is  larger 
than  the  measured  workload.  This  is  in  accordance  with  the  predicted  perfor¬ 
mance  which  was  superior  to  the  measured  decision  error.  Using  the  'matched 


*  During  the  single-indicator  experiment  no  workload  data  were  obtained. 
Therefore,  W  ,  etc.  were  assumed  to  be  equal  and  estimated  by  matching 

O 

the  M2  configuration.  The  result  was  a  rating  of  3.1* 
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fraction  of  attention  results  in  a  predicted  workload  very  close  to  the  value 
actually  measured  (see  table  5  and  figure  8).  So,  the  only  question  is  why 
the  predicted  performance-workload  trade-off  differs  from  the  measured  one. 

In  figure  9  the  theoretical  curves  of  decision  error  versus  fraction  of 
attention  are  shown  for  both  M2(C)  and  M3(C).  For  the  matched  M2(C)  task  the 
performance  versus  workload  trade-off,  IP  /  _iW  =  .12.  This  is  close  to  the 
value  for  the  M3(C)  configurations  (.11).  eSc,  the  suggestion  is  that  the* 
subjects  were  not  motivated  enough  to  spend  their  "full"  capacity  because  the 
pay-off  was  insufficient.  More  experimentation  will  be  needed  to  establish 
this  capacity/pay-off  relationship.  However,  this  can  be  considered  as  a 
(useful)  refinement  to  the  models  which  have  been  shown  to  describe 
encouragingly  the  control  and  decision  making  tasks  studied  in  this  program, 
both  in  terms  of  performance  and  workload.  '  ’ 
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TABLE  1:  RATING  SCALES 


Na me : 


Task : 

Using  the  scale  Lclow, 
performing  the  tusk 


indicate  the  degree  o*. 


effort  you  spend  on 


increasing  effort 

Hating  scale  for 

Controllability /observability  and  P.ecision 


Extremely  easy  to  control /monitor 
-  with  excellent  precision 


Bating  Scale  for 
Demands  on  Pilot 


8 


/monitor  with 

ery  easj  o  — ^ 

—  good  precision 


H  Easy  to  control /monitor  with  fair 
—  precision 


1“  r  Controllable/observable  with  somewhat 

-j  inadequate  precision 

,  controllable/observable,  but  only 

1  very  imprecisely 
—  Difficult  to  control/oonitor 
■_  very  difficult  to  control /noni tor  8 

-  Dearly  uncontrollable/unobservable  9 


Uncontrollable/unobservable 

Not  applicable 


10 


f  Completely  undemanding,^ 

*1  very  relaxed  ^nd  ccm:crta~-e 

_  largely  undemanding  relaxed 


f  Mildly  demanding  of  pilot 
[attention,  skill,  or  effort 

[Demanding  of  pilot  attention, 
-[skill,  or  effort 

r>ry  demanding  of  pilot  at- 
—  [tent ion,  skill,  or  eftort 

[  Completely  demanding  of  pilot 
“[attention,  skill,  or  efiort 

’  Nearly  uncontrollable/unobservable 

[""I  Uncontrollable/unobservable 

n  Not 
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AN  EXPERIMENTAL  SITUATION  FOR  STUDY  OF  PILOT  INTERACTION 

WITH  AUTOMATED  AIRBORNE  DECISION  MAKING  SYSTEMS 

By  William  B.  Rouse,  Yee-Yeen  Chu,  and  Rex  S.  Walden 

University  of  Illinois 
Urbana,  Illinois  61801 


SUMMARY 


This  paper  describes  an  experimental  situation  which  allows  vary^g 
degrees  of  automation  in  a  set  of  control  and  monitoring  tasks  The  control 
tasks  involve  flying  a  map  display  at  specified  altitude  and  speed.  The 
tasKs  xnvoi.  y  *  detection  of  events  in  N  dynamic  processes  each 

STtS  haTfn  actions  which  inj.lv. 

a  checklist-like  tree  search.  Computer  aid,  with  adjustable  reliability, 
is  available  for  both  the  control  and  monitoring  tasks.  A  series  of 
experiments  which  are  to  be  performed  in  this  experimental  situation  are 

described. 


INTRODUCTION 


Several  factors  are  leading  to  the  consideration  of  automated  deci- 
.  makine  svstems  for  aircraft  operations.  Aircraft  are  becoming  more 
sophis ticated^and  complicated  while  greater  precision  and  performance  is 
being  required  of  the  pilot.  The  pilot  simply  does  not  have  the  time  to 
do  everything  well.  At  the  same  time,  computers  are  becoming  smaller, 
faster,  and  cheaper.  Thus,  computer-aided  decisionmaking  is  (or  will  ) 
both  desirable  and  feasible. 

it  ts  unlikely  that  the  computer  will  completely  replace 
the  pilot  In  failed  orwmsual  situations,  the  pilot  will  be  called  upon 
^  11  n^-form  tasks  normally  allocated  to  the  computer.  Also,  the 

pilorwil/serve  as  an  executive  or  manager  providing  goals  to  the  computer, 
monitoring  overall  performance,  and  occasionally  preempting  inappropriate 
decisions  by  the  computer. 

The  problem  area  addressed  by  the  project  discussed  in  this  paper  is 
the  interaction  between  a  human  pilot  and  a  computer  with  decision  “^ing 

C1.i  .lifv  ^  eoal  Qf  tfce  research  is  to  enhance  cooperation  between 

the^o  decision  makers  by  understanding  and  then  avoiding  possible  modes 
of  competition  between  them. 


*  Supported  by  the  National  Aeronautics  and  Space  Administration  under 
NASA  -  Ames  Grant  NSG-2I19. 
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To  be  more  specific,  we  are  concerned  with  multi-task  situations 
and  the  issues  of  primary  interest  include: 

1.  Allocation  of  responsibility  between  human  and  computer. 

2.  Resolution  of  conflicts  between  human  and  computer. 

3'.  Human  confidence  in  the  computer  system. 


onsidering  allocation  of  decision  making  responsibility,  we  have  pro¬ 
posed  that  responsibilities  not  be  strictly  assigned  to  each  decision 
maker  [l  2] .  Instead,  allocation  should  adapt  to  the  state  of  the  aircraft 
and  the  state  of  the  pilot.  Both  pilot  and  computer  would  have  responsi¬ 
bility  for  all  or  most  decision  making  tasks  with  responsibility  at  any 
particular  instant  being  assigned  to  the  decision  maker  most  able  at  that 
moment  to  perform  the  task.  Simple  concepts  from  queueing  theory  indicate 
that  such  a  procedure  for  allocation  of  responsibility  would  improve  the 
utilization  of  system  (aircraft  plus  pilot)  resources  and  thereby  improve 
system  performance.  This  approach  would  allow  the  pilot  to  retain  a  cohe¬ 
rent  role  in  the  sense  of  having  overall  responsibility  for  the  whole 
aircraft  while  the  computer  would  enable  the  pilot  to  avoid  having  to 
continually  exercise  all  of  these  responsibilities. 


The  most  significant  disadvantage  of  adaptive  allocation  is  the  possi¬ 
bility  of  conflict  between  the  two  decision  makers.  Without  sufficient 
information  about  each  other’s  actions,  the  pilot  and  computer  might 
compete  to  perform  tasks.  This  would  degrade  system  performance  (and 
perhaps  be  disastrous)  and  possibly  cause  adaptive  allocation  to  be  an 
unattractive  approach.  Theoretical  and  experimental  approaches  are  being 
employed  to  assess  the  costs  of  conflict  and  devise  methods  of  avoiding 

conflicts . 


of  course,  the  success  of  any  adaptive  decision  making  system  will 
depend  on  the  pilot's  confidence  in  the  system.  Within  our  experiments, 
we  plan  to  study  t.ie  pilot's  decisions  concerning  mode  of  use  of  the  compu¬ 
ter  system  and  his  ability  to  detect  when  the  decision  making  system  has 

failed . 


THEORETICAL  FORMULATION 


To  investigate  the  feasibility  of  adaptively  allocating  decision  making 
responsibility  between  human  and  computer,  a  matnematical  model  of  multi¬ 
task  decision  making  was  developed  and  several  simulation  experiments  were 
performed  [3],  The  goal  of  this  effort  was  determination  of  the  effects 
of  several  system  variables  including  number  of  tasks,  mean  time  between 
arrivals  of  action-evoking  events,  human-computer  speed  mismatch,  proba¬ 
bility  of  computer  error,  probability  of  human  error,  and  the  level  of 
feedback  between  human  and  computer. 

The  model  is  based  on  queueing  theory  concepts.  Multi-task  decision 
making  is  described  iu  terms  of  events  and  actions.  The  uecisioci  matter  s 
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•  *-  -  f;  nn-evokinK  events  and  to  decide  whether  or  not  any 

task  is  to  *  his  fun  attention.  The  model  assumes  the  human 

a  auasltoptimal  decision  making  strategy  for  scanning  displays 
to  employ  a  quasi  op  Ho„aver  the  results  of  our  simulation  studies 

.“t  ™  “loly 

because  we  were  most  interested  m  gross  etfects. 

.  .  •,  has  ffivea  us  valuable  insights  to  problems 

TU  Jj  wich  che  design  of  an  adaptive  decision  making  system.  However, 
rfore  “  o»  nsf  thil  fSmulation  to  evaluate  potential  designs  a  better 
unders candlng  “  human  decisi.n  making  in  mulri-rask  situation,  is  needed. 


an  experimental  situation 


m  min  .  ’  rformed  without  substantial  changes  in  the  sortware 

-X  wanted  to  be  consistent  with  the  simulations 
being  developed  at  NASA  Ames  Research  Center  .a. . 

Figure  1  illustrates  the  CRT  display  observed  by  the  pilot.  The 
,  §-  uorripff  Packard  1310  while  the  computer  generating  the 

display  is  a  /4Q  augmenCed  bv  special  purpose  equipment  for  refreshing 

display  is  a  PDP  . s8cask  is  to  fly  the  Boeing  707  dynamics  along 

the  disp  ay.  specified  altitude  and  air  speed.  His  flight 

ins  trument^ar^d  is  played  b^ide  the  map.  The  pilot  has  aileron,  elevator, 
and  thrust  controls. 


Figure  1:  The  Experimental  Situation 
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The  instruments  below  the  map  represent  the  numerous  aircraft  sub¬ 
systems  (e.g. ,  electrical,  hydraulic,  etc.)  which  the  pilot  monitors  for 
possible  action-evoking  events.  Each  of  the  subsystem  instruments  repre¬ 
sents  a  linear  dynamic  process  driven  by  non-zero  mean  wnite  noise  input. 

An  event  is  defined  as  the  removal  of  the  input,  which  causes  the  state 
of  the  process  to  asymptotically  approach  zero  (pointer  down). 

Upon  detecting  an  event  to  which  he  wishes  to  respond,  the  pilot 
selects  that  subsystem  via  a  4  x  3  keyboard.  The  display  shown  in  Figure  2 
then  appears.  This  represents  the  first  level  of  a  checklist-like  tree 
associated  with  the  subsystem  of  interest.  He  searches  for  a  branch  labeled 
with  a  zero  and  selects  that  branch  with  his  keyboard.  It  the  tree  has 
more  levels,  the  next  level  is  then  displayed.  After  completing  the  last 
level  of  the  tree,  the  action  is  complete  and  the  display  shown  in  Figure  1 

returns . 


Figure  2:  Display  When  Pilot  Has  Reacted  to  an  Action-Evoking  Event 


Within  this  experimental  situation,  we  can  vary  the  complexity  of  the 
map,  the  availability  of  autopilot,  the  number  of  subsystems,  the  distri¬ 
bution  of  arrivals  of  events  among  tasks,  and  many  other  aircraft  and  sub¬ 
system  parameters. 


EXPERIMENTS  PLANNED 


\ 
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Two  sets  of  experiments  are  planned  using  the  above  experimental  situa¬ 
tion.  The  first  set  will  address  the  question:  How  is  pilot  decision 
making  (in  terms  of  response  times,  errors,  and  computer  usage) ^affected 
by  task  complexity,  autopilot,  and  non-adaptive  computer  aiding. 

At  this  point,  the  following  scenario  is  envisioned.  Subjects  will 
fly  50  minutes  enroute  (simple  map)  and  there  will  be  little  su  system 
activity  (few  events).  Then,  they  will  fly  10  minutes  in  a  landing  situa¬ 
tion  (complex  map)  where  there  will  be  considerable  subsystem  activity. 
Experimental  measurements  will  include: 


I #  Response  time  to  subsystem  events, 

2.  Probabilities  of  error, 

a.  False  alarm 

b.  Missed  event 

c .  Incorrect  action 

3.  Course  errors, 

4.  Control  actions. 

For  the  first  experiment,  each  subject  will  perform  three  trials 
where  the  experimental  variable  of  interest  will  be: 


1.  No  autopilot, 

2.  Reliable  autopilot, 

3.  Unreliable  autopilot. 


Reliable  autopilot  refers  to  one  with  a  low  probability  of  malfunction 
while  unreliable  refers  to  one  with  a  high  probability  of  malfunction. 

An  autopilot  malfunction  is  characterized  by  the  aircraft  maintaining  its 
current  bearing  and  air  speed  and  thereby  eventually  deviating  from  the 
desired  path.  Once  a  malfunction  occurs,  a  subject's  task  is  Co 
control  of  the  aircraft,  return  it  to  the  appropriate  course,  and  ra-engage 
the  autopilot  (which  never  fails  irreparably). 

The  second  experiment  of  this  first  set  of  experiments  will  employ 
the  same  scenario  and  consider  pilot  interaction  with  a  decision  maxing 
system  designed  to  aid  him  in  detecting  subsystem  events  and  acting  appro¬ 
priately.  This  system  will  be  non-adaptive  and  designed  on  the  basis  of 
results  obtained  from  the  first  experiment.  The  computer  aid  will  have 
two  modes  of  operation  (other  than  "off”): 


1.  Detection  only, 

2.  Detection  and  action. 


It  will  also  have  an  internal  adjustment  whereby  its  probability  of  success 
fill  decision  making  can  be  changed.  When  initialized  this  probability 
will  be  high  enough  to  be  a  significant  aid  to  the  pilot.  However,  if  a 

,  r  _ _  -bill tv  will  decrease  to  the  point  of  making 

malfunction  ,  tne  *■ — 
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the  computer  aid  a  hinderance.  The  subject's  task  will  be  to  detect  this 
occurrence- arid  reset  the  adjustment  via  a  simple  pushbutton.  The  computer 
aid  will  have  two  levels  of  reliability  which  will  be  characterized  in  a 
manner  similar  to  the  autopilot,  probably  using  the  same  numerics  to  stan¬ 
dardize  subjects'  perception  of  reliability. 


The  empirical  results  obtained  from  this  first  set  of  experiments 
will  be  combined  with  the  theoretical  formulation  discussed  above  and 
employed  in  the  design  of  an  adaptive  decision  making  system.  Besides 
the  queueing  theory  formulation,  we  are  also  considering  control  theory 
as  a  methodology  with  which  to  design  an  adaptive  system.  Another  issue 
of  special  importance  to  an  adaptive  system  is  the  measurement  of  both 
aircraft  and  pilot  states.  Several  alternative  approaches  are  being 
considered.  The  adaptive  system  that  results  from  our  investigations  will 
be  experimentally  evaluated  using  the  situation  discussed  in  this  paper. 


S- 


CONCLUSIONS 


It  appears  inevitable  that  aircraft  will  become  increasingly  automated. 
However,  it  also  appears  unlikely  that  the  pilot  will  disappear  from  the 
cockpit.  Instead  the  pilot  and  computer  will  have  to  cooperate  in  manag¬ 
ing  the  aircraft.  This  paper  has  discussed  an  experimental  situation  that 
is  being  used  to  .  theoretically  and  experimentally  determine  how  cooperation 
between  these  two  decision  makers  can  be  enhanced. 
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AN  EXPERIMENTAL  SITUATION  FOR  STUDY  OF 

HUMAN  DECISION  MAKING  IN  MULTI- PROCESS  MONITORING* 

by  William  B.  Rouse  and  Joel  S-  Greenstein 

University  of  Illinois 
Urbana,  Illinois  61801 


SUMMARY 


This  paper  describes  an  experimental  situation  which  has  been  developed 
to  enable  study  of  the  human's  decision  strategy  and  performance  when  moni¬ 
toring  multiple  dynamic  processes.  The  decision  maker's  goal  is  to  detect 
dynamic  and/or  statistical  changes  in  any  of  the  processes  and  take  appro¬ 
priate  action  to  return  the  process  to  an  acceptable  mode  of  behavior..  This 
situation  is  somewhat  analogous  to  industrial  monitoring  and  air  traffic 
control.  A  series  of  planned  experiments  is  discussed  whose  results  are  to 
be  employed  in  the  design  of  a  computer-aided  process  monitoring  system  where 
the  computer  learns  how  to  detect  changes  by  "watching"  the  human  perform 
the  task. 


INTRODUCTION 


Quite  frequently,  a  human  must  simultaneously  monitor  several  dynamic 
processes.  His  cask  is  to  detect  action-evoking  events  and  implement  the 
appropriate  actions.  As  the  number  of  processes  increases  and/or  as  the 
frequency  of  action-evoking  events  increases,  the  human  becomes  overloaded 
and  overall  system  performance  decreases. 

The  goal  of  the  project  discussed  in  this  paper  is  the. design. of  a 
computer-aided  process  monitoring  system.  Three  important  ^issues  in  the 
•"design  of  such  a  system  are: 

1.  Allocation  of  decision  making  responsibility, 

2.  Resolution  of  conflicts  between  decision. makers,  -  « 

3.  Human  confidence  in  the  computer  system; 

Theoretical  studies  [1,2,3]  have  led  us  to  suggest  .that  allocation  of  respon¬ 
sibility  should  be  dynamic  (i.e.,  situation  dependent)  with  particular 
decision  making  tasks  being  assigned  to  the  decision  maker  who,  at -the 
moment,  has  the  time  to  devote  to  the  task.  This  approach  can  yield  signifi¬ 
cant  performance  benefits  if  conflicts  between  decision  makers  can  be  avoided. 


*  Supported  by  the  Joint  Services  Electronics  Program  (U.S.  Army,  U.S.  Navy, 
and  U.S.  Air  Force)  under  Contract  DAAB-07-77-O0259. 
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Instead  of  resolving  conflicts,  it  would  be  preferable  to  avoid  them. 

An  approach  to  accomplishing  this  involves  giving  the  computer  a  model  of  • 
human  behavior  so  that  the  ccmputer  may  infer  the  state  of  the  human  within 
the  overa’l  system.  To  develop  an  understanding  of  human  decision  making 
in  the  simultaneous  monitoring  of  several  dynamic  processes,  we  have  designed 
an  experimental  situation.  This  situation  will  be  employed  both  in  the 
development  of  models  of  human  decision  making  and  for  evaluation  of  poten¬ 
tial  systems  for  computer-aided  process  monitoring. 


AN  EXPERIMENTAL  SITUATION 


Figure  1  illustrates  the  display  observed  by  the  subject.  The  disp’ay 
is  generated  on  a  Tektronix  4010  display  terminal  by  a  time-shared  DEC- 
System  10  computer  and  depicts  the  measured  values  of  the  outputs  of  nine 
processes  over  the  past  100  sampling  intervals.  The  subject's  task  is  to 
monitor  the  processes,  via  the  display,  for  the  occurrence  of  abnormal  eyents 
The  processes  normally  have  identical  second  order  system  characteristics 
with  zero-mean  Gaussian  white  noise  inputs  of  identical  variance..  The  dis- 
played  measurements  of  the  process  outputs  are  corrupted  by  additive  zero- 
mean  Gaussian  white  noise  sequences  which  normally  have  identical  variance. 
Abnormal  events  might  be  represented  by  changes  in  the  dynamics  of  a  process, 
changes  in  the  process  input,  or  an  increase  in  measurement  noise  variance. 

After  scanning  the  nine  process  histories,  the  subject  is  given  an 
opportunity  to  key  in  the  numbers  of  processes  in  which  ne  has  decided  an 
abnormal  event  has  occurred.  He  also  enters  his  estimate  of  the  time  at 
which  the  event  occurred.  Upon  completion  of  his  responses  he  is  given 
feedback  regarding  the  actual  states  of  the  processes  he  has  keyed  m  (1 
indicates  the  normal  state,  "0"  indicates  an  abnormal  state).  The  display 
is  then  erased,  anv  abnormal  processes  detected  by  the  subject  are  returned 
to  the  normal  state,  and  a  new  display  depicting  the  process  histories 
advanced  in  time  is  generated  as  illustrated  in  Figure  -• 

Witlfin  this  experimental  situation  we  can  vary  the  number  and  type  oF- 
different  events,  the  frequency  of  occurrence  of  events,  the  distribution 
of  events  over  processes,  ^rd  the  amount  of  time  the  process  histones  are 
advanced  .each  iteration.  The  format  in  which  the  subject  reports,  event 
occurrences  and  receives  feedback  can  also  be  varied. 


EXPERIMENTS  planned 


Two  sets  of  experiments  are  planned  that  will  employ  the  above  experi¬ 
mental  situation.  The  first  set  will  investigate  human  decision  making  in 
event  detection  and  attention  allocation.  In  the  second  set  of  experiments, 
computer-aided  process  monitoring  will  be  studied. 


Wi“hin  the  first  set  of  experiments,  one  experiment  is  currently 
underway  while  another  will  soon  begin.  The  first  experiment  is  aimed 


at 
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understanding  the  human* s  event  detection  behavior.  Subjects  are  allowed 
to  respond  to  as  many  events  as  they  think  have  occurred.  Their  false  alarms 
are  subtracted  from  their  hits  and  then,  the  remainder  is  divided  by  a 
measure  of  their  delay  in  responding  (event  waiting  time).  The  resulting 
measure  is  their  score  which  they  are  instructed  to  maximize. 

To  model  human  decision  making  in  this  task,  we  are  employing  signal 
detection  theory  and  discriminant  analysis.  Our  goal  is  to  develop  a  model 
that  will  be  of  real  time  use  to  a  computer  as  it  attempts  to  assess  the 
state  of  the  decision  maker. 

The  experiment  that  is  soon  to  begin  emphasizes  event  detection  and 
attention  allocation.  In  this  experiment,  subjects  will  only  be  allowed  to 
respond  to  one  event  per  iteration.  Thus,  they  must  tradeoff  uncertainty 
and  costs  to  reach  a  decision.  We  have  developed  a  queueing  theory  model 
of  this  task  and  will  compare  it  to  the  human's  decision  making  performance 
in  this  experiment. 

The  second  set  of  experiments  will  consider  computer-aided  process 
■monitoring.  ;We  envision  that  the  computer- will  employ  the  de-tSec t'ioti  *ancf 
attention  allocation  models  noted  above  to  assess  the  state  of  the  decision 
maker.  The  computer  will  then  be  able  to  adapt  to  both  the  state  of  the 
task  and  the  state  of  the  human. 


CONCLUSIONS 


Many  human  tasks  can  be  abstracted  as  the  simultaneous  monitoring  of 
multiple  dynamic  processes.  Computer  aiding  may  allow  the  human  to  monitor 
an  increased  number  of  processes  and  also  to  monitor  more  effectively.  The 
research  discussed  in  this  paper  is  aimed  at  designing  and  evaluating  a 
computer-aided  process  monitoring  system  along  the  lines  of  that  proposed 
in  reference  3. 
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Figure  2:  An  Updated  Display 
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THE  PSYCHOPHYSICS  OF  RANDOM  PROCESSES 
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Man-Vehicle  Laboratory 
Massachusetts  Institute  of  Technology 
Cambridge,  Mass.  02139 

ABSTRACT 


One  of  the  major  functions  of  the  human  monitor  is  to  detect  failures 
in  systems  under  observation.  For  systems  which  display  continuous¬ 
time  variables,  this  is  equivalent  to  determining  when  the  statisti¬ 
cal  parameters  of  the  random  process  have  changed.  This  paper  re- 
ports  on  sc5me  p r eTim i na~ry '* e xperiment S'  ttriiet ermine  the  threshold 
values  for  changes  in  the  variance,  bandwidth,  and  damping  ratio  of 
a  second  order  random  process.  The  detection  time  appears  to  be 
summarized  by  the  change  in  the  standard  deviation  of  the  velocity 
of  the  display. 


INTRODUCTION 


Human  monitoring  and  supervisory  control  nas  become  a  research  topic  of 
increasing  interest  due  to  the  advances  in  lower  cost  automation  (Sheridan  and 
Johansen,  1976).  One  aspect  of  human  monitoring  is  that  of  failure  detection, 
and  some  results  and  models  have  been  proposed  for  modelling  the  human’s  ability 
to  detect  changes  the  mean  of  random  process  (Gai  and  Cu*ry,  1976).  However, 
there  are  many  other  parameters  of  a  random  process  which  are  subject  to  change 
(Anyakora  and  Lees,  1972) ,. examples  of -which  are  changes  in  variance,,  band¬ 
width,  and  damping  ratio.  In'  a  recent  "review  (Curry  and  Gai,  1976)  we  were 
unable  to  uncover  data  on  the  ability  of  the  human  to  detect  changes  in  these 
parameters.  Thus  we  performed  a  set  of  experiments  to  determine  the  prelimin¬ 
ary  threshold  values  for- changes  in  these  random  process  parameters,  i.e., 
to  determine  some  of  the  basic  psychophysical  constants  of  random  processes. 
These  data  when  complete  will  be  valuable  in  designing  more  extensive  experi¬ 
ments,  and  will  also  be  useful  in  modelling  efforts  and  in  obtaining  perfor¬ 
mance  estimates  for  the  human  in- failure  detection  tasks. 
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experiment 


Overview 


order  shaping  filter  with  transfer  functxon 

_  K  _ _ 

(s/u»n)‘  +  2(C/u>nJs  +  i 

and  zero  mean  white  noise  input^  ^^^^^^^rtcreen^  After  the  steady 
displacement  of  a  horizontal  1  of  the  filter  was  either  increased  or 

state  was  reached,  one  of  the  P  ^ c  thg  in>  K  ^for  noise  power  and  hence 
decreased  (frequency,  «*>  .  ^  expected  to  detect  the  "failure,"  or 

the  output  variance)):  t  e  ,  h  detection  time  were  recorded  for 

change.  All  the  relevant  parameters  and  the  de  .ect 

each  trial. 

.  *  ^ 

•>  ■ 

Equipment 

A  PDP-11/10 

digitally^nd  Tas  ^.3“through  a  -oscillo-* 

analog  computer  was  then_  used  to  smooth  th  P  ^  ^  shaping  filter 

scope  screen.  The  smoo thin fil ter  ^Hliminate^ny  discrete  jumps  when 
natural  frequency,  ^ufficie  J  was  increased.  The  subject  was 

the  high  frequency  content  of  t  P  n  screen.  The  displav  was 

seated  about  2>i  feet  m  front  of  e  jUosc  >  M  6a  of  the  process. 

scaled  to  such  that  that  the  sere  g  The  subject  had  two  switches  (for 

Graticule  marks  provided  the  re  ^  ’  htg  decision  when  the  change  was 

SSSf8  Thesrswitchefwlre  continuousiy  read  before  each  update  of  the 
output  of  th.  process  every  1/30  of  a  second. 


Procedure 

j  ^  in  two  sessions  lasting  approxi- 

Sixteen  graduate  students  ?aTtxc  V  ,  ^  of *three  series  of  trials  in 

mately  AO"  minutes  each.  Eac  hanged  from  its  nominal  value.  Only 

which  one  o'f  the  three  parameters  was  ch< a  g  session;  the  four 

=  -  3  - 

=  .2,  .707.  f am-i  1  i arization  phase,  the  normal  or  nominal  mode 

At  the  start  of  the  familia  .  After  this  normal  mode  presentation 

was  shown  to  the  s^fct  £oT  SiUarize' the  subject  with  the  nature  of  the 
large  failures  were  shown  tofamiliarizetn  ^  secondg  after  the  start 

change.  The  change  ^cured jandomly^bet  sufficient  to  allow  the  sub- 

I'JVZZ  me  SCiih  the  changes.  If  the  procedure  was  not  clear  at 

this  stage  the  five  initial  trials  were  repeated. 
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Before  the  beginning  of  the  experiment  the  subject  was  told  of  the  stop¬ 
ping  criterion,  i.e.,  that  the  objective  was  to  determine  the  minimum  detect¬ 
able  change  in  the  parameter  value.  He- was  instructed  to  press  the  switch  as 
sc-'n  as  he  was  certain  about  his  decision.  Correct  detections  were  acknowledg¬ 
ed  by  the  printing  of  a  single  character  on  the  teletype  which  was  audible  to 
the  subject.  When  an  error  was  made,  the  nature  of  the  error  was  printed  on 
the  teleptype  and  the  experimenter  informed  the  subject  about  the  error.  Three 
types  of  error  were  possible:  if  a  change  was  ’detected’  before  one  really  oc- 
cured  it  was  labelled  a  false  alarm;  if  change  was  judged  as  a  decrease  while 
it  was  actually  an  increase  (or  vice  ve-sa) ,  or  if  the  change  was  not  de'  ected 
within  the  time  limit,  it  was  labelled  as  a  missed  alarm.  During  any  trial, 
the  failed  mode  continued  for  30  seconds  after  the  beginning  of  the  failure; 
if  a  decision  had  not  been  made  by  this  time,  the  motion  stopped  au  zero  and 
a  period  of  5  seconds  was  given  :o  decide.  If  a  decision  was  not  made  after 
this,  it  was  considered  a  missed  alarm.  The  experiment  continued  until  he 
made  three  or  more  errors  in  the  m-.st  recent  six  trials. 

After  each  trial,  a  blanking  period  of  three  seconds  was  given  before 
starting  the  next  trial.  From  one  trial  to  the  next,  the  parameter  change 

could  be  either  an  increase  or. a  decrease  according  to  the  following  relation 
✓  *  'r  ^  -  •  *  ■'■*'**• 

(---)  =  exp  {  In  R  x  S} 

PN 

P  -  Nominal  value  of  the  parameter 
N 

P  -  Changed  or  failed  value 

R  -  Ratio  for  initial  change  (R  =  10) 

S  -  Stimulus  (S  =  ±  .8,  ±  .6,  -  .4,  ±  .2,  x  0.16,  t  0.12, 
±  0.04,  ±  0.03,  ±  0.02,  ±  0.01,  ±  0.008,  ±  0.006, 
x  0.004,  x  0.002,  x  0.0013,  x  0.0001,  x  0.0005) 

To  avoid  guessing  by  the  subject,  the  decrease  in  magnitude,  as,  was  taken  for 
two  steps,  resulting  in  a  set  of  four  stimuli  (S^,  st^mu^-us 

was  chosen  at  random  for  presentation  among  these  four  until  all  the  four  were 
exhausted.  ‘  Then  the  next  set  of  four  was  similarly  chosen  and  presented.  The 
subject  was  told  only  that  the.  magnitude  of  change  would  be  decreasing  in  such 
a  way  that  it  would  become  progressively  harder  to  detect  its  change.  Initial¬ 
ly,  the  changes  were  rather  large,  S  =  ±  .8  or  0.6  etc. 

RESULTS 

•  Figures  1,  2,  and  3  show  plots  of  detection  time  as  a,  function  oj  the 
ratio  of  the  parameter  change  (on  a  logarithmic  scale)  for  variance,  bandwidth 
(period),  and  damping  ratio.  .For  later  analysis  .we  have  plotted  detection 
times  for  changes  in  variance  and  bandwidth  as  a  function  of  the  change  in  the 
standard  deviation  of  the  observed  random  process.  The  curves  shown  in 
Figures  4  and  5  are  least  squares  fit  to  the  detection  time  data. 

Threshold  values  were  obtained  using  a  maximum  likelihood  estimate  for 
the  observed  probabilities  of  correct  responses  assuming  a  high-threshold 
model  for  detection.  The  likelihood  function  for  the  observed  responses  is 
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•given  by  *  : 


i  =  n  p(c|s  ) 

i 

where  P(c|Si) 
P(W|SJ 
n 

c . 


n  n 

ci  P(W|Si)  Wi 

=  Probability  correct,  stimulus 


=  1  -  P (C I S±) 

=  number  correct,  stimulus  S. 

x 


=  number  wrong,  stimulus 

The  expression  for  probability  of  being  correct  for  the  high-threshold-theory 
model  is 


P(C|S  )  =  P(C|D)P(D)  +  P(C|D)P(D) 


where  P(c|d)  =  1 


P(C|D)  =  H  (guessing  factor) 


D  in  detection  state,  D^in  nondetection  state 


where  A  is  the  stimulus  level,  U  is  the  threshold  value  and  O’  is  the  standard 
i 

deviation.  We  found  that  using  A^  =  log  S ^  gave  an  appropriate  Gaussian  form 

for  the  stimulus  values.  (This  may  be  due  to  the  somewhat  limited  number  of 
responses  at  each  stimulus  level,  at  the  most  eight). 

The  calculated  thresholds  are  shown  in  Table  1  for  the  four  nominal  and 
random  processes.  The  fit  of  this  threshold  model  to  the  data  was  not  parti¬ 
cularly  good  because  of  the  small  number  of  samples  available,  so  that  these 
values  must  be  considered  extremely  preliminary.  This  is  due  to  the  small 
number  of  subjects  and  the  measurement  technique  which  we  employed  to  obtain 
a  rapid  determination  of  threshold.  Our  subsequent  experiments  will  use. 
more  conventional  psychophysical  techniques. 


DISCUSSION 


The -subjects  had  no  difficulty  distinguishing  between  increases  ’and 
decreases  in  bandwidth  and  variance,  i.e.,  all  subjects  would  agree  that  an 
"increase"  had  occurred  when  ’so  indicated  by  th&  experimenter.  The  same' was 
not  true  for  changes  in  damping  ratio,  which  is  reflected  in  the  wide  varia¬ 
tion  of  detection  times  as  shown  in  Figure  3.  We  finally  arrived  at  a  proce¬ 
dure  of  never  explaining  that  it  was  an  "increase”  or  ,,decrease,,,  but  just 
told  the  subjects,  during  the  familiarization  phase,  that  they  were  about  to 
see  a  change  of  one  sign  or  the  other;  we  let  the  subjects  determine  whether 
it  should  be  considered  an  "increase"  or  "decrease." 

Examination  of  Figure  1  and  2  also  indicates  that  there  is  an  asymmetry 
in  detection  times  for  increase  and  decreases  in  variance  and  bandwidth.  A 
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heuristic  explanation  for  the  asymmetry  of*  detection  times  for  changes  in  vari¬ 
ance  is-  suggested  by  the  "exceedance  limits"  hypothesis.  The  subjects  learn 
what  values  are  "rarely  exceeded"  which  correspond  to  (say)  2  a  levels.  The 
first  time  that  the  display  exceeds  this  level  is  an  indication  that  the  vari¬ 
ance  has  increased,  and  thus  an  increase  of  variance  is  indicated  by  the  sub¬ 
ject.  On  the  other  hand,  when  the  variance  is  decreased,  it  takes  the  subject 
more  cycles  of  the  random  process  to  determine  that  the  display  is  not  coming 
as  close  to  the  exceedance  limit  as  it  had  before. 

This  asymmetry  seems  somewhat  surprising  if  one  considers  the  stimulus 
as  a  change  in  RMS  velocity,  because  it  can  be  shown  that  for  the  random  pro¬ 
cesses  considered  here,  the  standard  deviation  of  display  velocity  is 


If  the  concept  of  a  Weber  fraction  holds,  then  we  have 

Aa*  Aw  Aa 

_ x  __  _ n  _ _ x 


which  would  not  pimply . any  asymmetry  in  detect^  times.  Such  a  model  would 
seem  appropriate  since  Brown  (1960)  has  shown  that  the  relative  velocity 
threshold  is  approximately  one-tenth  of  the  absolute  velocity.  On  the  other 
hand,  Figures  4  and  5  show  that  the  detection  times  are  reasonably  well  describ¬ 
ed  by  the  change  in  absolute  RMS  velocity,  rather  than  its  relative  value  (a 
possibility  suggested  by  Capt.  R.  Gressang) . 

As  an  alternative  to  the  velocity  effects,  a  normative  approach  to  detect 
the  changes  in  parameters  would  be  to  examine  the  residuals  of  the  Kalman 
filter  designed  for  the  nominal  process.  If  the  Kalman  filter  is  operating  on 
the  random  process  for  which  it  was  designed,  the  residuals  will  be  white 
noise.  Any  deviation  of  the  residuals  from  the  white  noise  situation  indicates 
a  change  has  taken  place.  We  have  derived  the  equations  for  this  increment 
to  the  residual  auto-correlation  function  (Curry  and  Gai,  1976)  and  show  in 
Figure  6  ^ome  typical  changes  to  the  autocorrelation  function  for  changes  in 
frequency  and  damping  ratio.  (In  addition  to  the  autocorrelation  function 
shown,  the  Kalman  filter  residual  has  a  white-noise  component,  i.e.,  an  im¬ 
pulse  at  I  -  0  in  Figure  6.)  Figure  6  shows  that* an. increase  in  bandwidth 
yields,  a  much  larger ;-autocor,r elation*- component  than  does  a  decrease  in  band¬ 
width  of  even  greater  magnitude  (measured  on  a  logarithmic  scale) .  Thus  it 
seems  as  though  the  normative  Kalman  filter  model  for  failure  detection  may 
be  a  reasonable  basis  for  a  descriptive  model  of  human  failure  detection  for 
these  parameter,  changes,  as  well  as  thiechange  in  mean  of  a  random  process. 
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THRESHOLDS 


Nominal 

T+/t° 

T~/T° 

ax^/ax0 

ax  /ax° 

T  =  1,  C  =  0.2 

1.1549 

0.9326 

1.1610 

0.8097 

T  =  1,  v.=  0.707 

1.3072 

0.9165 
«  * 

1.2655 

0.7406. 

T  =  3,  £  =  0.2 

1.2167 

0.9009 

1.3917 

0.8710 

T  =  3,  £  ■  0.707 

1.2070 

0.9517 

1.1299 

0.7227 

Table  1:  Calculated  Thresholds  for  Period  and 
Standard  Deviation  Changes 
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Figure  3:  Detection  time  vs.  damping  ratio 
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Figure  5:  Least  squares  detec 
tion  time  for  variance  change' 
vs.  RMS  velocity  change 


Figure  4:  Least  squares  detec¬ 
tion  time  for  natural  frequency 
.  changes  vs.  RMS  velocity  change 
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A  NOVEL  APPROACH  TO  THE  CROSS-ADAPTIVE  AUXILIARY  TASK 

By  Arye  R.  Ephrath 

National  Research  Council  Associate 
NASA  Ames  Research  Center 


INTRODUCTION 


The  technique  of  the  adaptive  task  was  first  introduced  by  Birmingham 
(1959)  and  since  used  by  various  investigators  in  operator-performance  studies 
„  ,062-  Birmingham  et  al.,  1962;  Hudson,  1964;  Kelley,  1970).  Under 

this  tor  sobm,  the  task's  difficulty 

is  modulated  in  real  time  by  the  operator's  performance  score,  as  the  ope  . 
tor  performs  more  skillfully,  his  task  becomes  more  difficult.  This 
aue  found  useful  applications  in  a  number  of  areas,  such  as  measuring  opera¬ 
tor  skill,  personnel  selection  and  training,  and  manual-control  system  design 

(^£"  ^hetross-adaptive5 loading  task  technique  is  a  modification  of  the  self- 
ad  lusting  task  scheme  and  allows  the  investigator  to  keep  the  operator  s  per¬ 
formance^  the  primary  task  constant  (Figure  1):  thi  operator  s  Prl®a^7  t;as^ 
nerformance  is  monitored  continuously  and  compared  tn  a  predetermined  desired 
level-  primary  task  performance  which  is  better  than  the  standard  efrects  an 
increase  in  the  difficulty  not  of  the  primary  task  bit  of  a  concomitant  load¬ 
ing  talk,  thus  keeping  the  primary  task  performance  instant  by  varying  the 

operator  s  overall  workload^  h  however,  it  was  considered  desirable 

to  keep  the  overall  workload  constant  at  a  predetermined  level  and  to  study 
the  resulting  performance  in  a  primary  piloting  task.  Consequently,  a  measur¬ 
ing  (non-loading)  side  task  was  used,  and  it  was  placed  in  the  feedback  path. 
This  concomitant  task  continuously  measured  the  operator  s  workload  level  and 
deviation  of  this  measure  from  the  preset  standard,  properly  filtered,  modu- 
S  the  noise  content  (and  hence,  the  difficulty)  or  the  primary  piloting 

task  (Figure  2). 

THE  PRIMARY  TASK 


The  investigation  utilized  a  fixed-base  ground  simulator  which  was  con¬ 
figured  as  the  cockpit  of  a  deHaviland  CC-115  Buffalo  The  primary  task  con¬ 
sisted  of  flying,  a  straight,  220°  localizer  course  while  descending  a  2 
glide  slope  in  zero-visibility  conditions.  In  performing  this  task  the  sub¬ 
jects  relied  on  the  conventional  flight  instruments  -  airspeed,  attitude, 
vertical  speed  and  glideslope-deviation  indicators,  an  altimeter,  and  an 

electronic  horizontal-situation-display .  ,, t  *  this 

Simulated  wind  gusts  were  introduced  to  modulate  the  difficulty  of  t 
The  eusts  both  horizontal  and  vertical,  were  modelled  as  filtered 
noise  of  constant  bandwidth  and  variable  amplitude.  Mis  type  ^  ^turbance 
had  been  shown  (Ephrath,  1975b)  to  affect  the  pilot  s  workload  and  hence,  t 

task's  difficulty 
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THE  AUXILIARY  TASK 


The  auxiliary  task  in  this  study  was  meant  to  measure  L..e  subjects'  in¬ 
stantaneous  workload  without  introducing  any  significant  loading  of  its  own. 
It  consisted  of  identifying,  and  responding  to,  two  small  red  lights  which 
were  mounted  above  the  instrument  panel  (see  Figure  3).  One  or  the  other  of 
these  lights  illuminated  at  random  intervals,  uniformly  distributed  between 
0.5  and  5.0  seconds,  and  stayed  on  for  two  seconds;  the  subject's  task  was 
to  extinguish  the  illuminated  light  by  moving  a  toggle  switch  in  the  proper 
direction  -  up  for  the  top  light,  down  for  the  bottom  light. 

The  lights  were  located  75°  to  the  right  of  the  center  of  the  flight 
instruments.  Their  intensity  was  very  low  and  was  adjusted  individually  for 
each  subject  via  a  variable  resistance  to  ensure  that  the  subject  would 
not  be  able  to  detect  the  lights  in  his  peripheral  vision.  Consequently, 
the  subjects  could  not  perform  both  the  primary, and  the  auxiliary,  task  sim¬ 
ultaneously  and  had  to  switch  continually  between  the  two  taks. 

The  subjects  were  instructed  to  regard  the  piloting  task  as  their  major 
responsibility  and  to  respond  to  the  lights  only  if  they  felt  that  they  could 
do  so  without' sacrificing  their  piloting  performance.  Therefore,  good  auxi¬ 
liary  task  performance  was  assumed  to  imply  ease  of  the  primary  piloting 
task  (because  if  the  subject  was  faced  with  a  difficult  piloting  task,  he 
could  not  afford  to  switch  regularly  to  the  low-priority  auxiliary  task  and, 
consequently,  would  often  fail  to  respond  to  an  illuminated  light),  and 
vice  versa. 


1  **»£>  £  V 


-bj  CCt 


;uished  an  illuminated  light  a  "MISS ^  -  0" 


was  counted  and  the  subject's  response  time  RT^  was  recorded.  In  the 

absence  of  a  correct  response  the  light  stayed  on  for  two  seconds;  a 
"MISS  =  1"  was  counted  and  a  response-time  of  two  seconds  was  recorded. 
A  pseido-instantaneous  workload  index  was  updated  after  the  presentation 
of  each  light,  utilizing  the  data  of  the  subject’s  response  to  the  last 
two  stimuli: 

7S.0(RT .  ,  +  RT.)  +  125. 2 (MISS  +  MISS  ) 

WLX.  =  i-l  l _ iri - i — 

1  S 


which  resulted  in  a  workload- index  value  between  0  and  100.  A  measuring 
auxiliary  task  of  this  type  has  been  shown  (Ephrath,  1975a  and  1975b)  to 
cause  minimal  loading  of  the  subject;  the  particular  numerical  coefficients  * 
chosen  maximize  the  sensitivity  of  this  workload  measure  (Spyker  et  al., 

This  workload  index  was  compared  with  the  desired  workload  level;  the 
error  signal  was  then  integrated  (to  eliminate  steady-state  position  error). 


limited  (to  eliminate  excessive  gusts  and  to  improve  stability)  and  used  tc 
modulate  the  amplitude  of  the  simulated  gust  disturbance  (Figure  A). 


RESULTS  AND  DISCUSSION 


Each  of  the  professional  airline  pilots  who  served  as  subjects  flew  18 
simulated  approaches  in  the  course  of  the  formal  experiment.  We  were  aware 
of  the  fact  that  they  might  suspect  a  causal  relationship  between  their  per¬ 
formance  on  the  light-cancelling  auxiliary  task  and  the  accompanying  changes 
in  the  strength  of  the  gusts.  To  minimize  this  possibility,  a  value  of  *  0.5 

was  chosen  for  the  integrator  gain  (Figure  A).  This  value  effected  a  notice¬ 
able  change  in  the  gusts’  strength  in  approximately  ten  seconds  when  the  warn¬ 
ing  lights  were  completely  ignored  (implying  a  workload  index  of  100) ;  the 
time  lag  was  short  enough  to  keep  the  scheme  sensitive  to  changes  in  measured 
workload,  yet  apparently  it  was  also  sufficiently  long  to  mask  the  correlation 
between  gusts'  strength  and  auxiliary-task  performance:  not  only  didn't  any 
of  our  subjects  detect  the  correlation,  but  neither  did  the  colleagues  who 
served  as  preliminary  subjects  and  who  might  have  known  better  or  the  program¬ 
mers  who  wrote  the  software  (without  being  aware  of  its  purpose)  and  who  had 
spent  countless  hours  flying  and  debugging  the  simulator. 

Workload  dat  of  a  typical  experimental  run  are  shown  in  Figure  5.  The 
workload  index  was  computed  from  a  point  A6  seconds  after  the  start  of  the 
simulated  approach  and  is  seen  to  oscillate  around  the  approximate  reference 
value  WLX  .  At  the  point  indicated  by  the  arrow  a  simulated  guidance  mal¬ 
function  occurred  (lateral  flight  director  decoupled  from  the  localizer)  and 
the  subject's  workload  index  is  seen  to  rise  as  he  devoted  more  attention  to 
the  primary  flying  task.  The  mean  workload  index  in  this  particular  run  was 
63.6,  compared  to  a  desired  (reference)  workload  index  of  65.0.  At  the  pre¬ 
sent 'time,  the  data  of  a  total  of  108  experimental  runs  is  being  processed 
for  more  thorough  analysis. 


CONCLUSIONS 


This  paper  presents  a  scheme  for  modulating  the  difficulty  of  a  primary 
manual-control  task  by  means  of  an  auxiliary  task  feedback.  The1  method  may 
find  useful  applications  in  operator  workload  research,  in  part-task  simula¬ 
tion  studies  and  in  personnel  training. 

In  our  work,  primary  task  difficulty  was  controlled  by  modulating  the 
noise  content  of  the  forcing  function,  and  our  results  suggest  that  a  stable 
workload  level  may  be  achieved  by  proper  selection  of  the  subsidiary  task  to 
be  used  and  by  proper  design  of  the  adaptive  logic.  Other  methods  of  control¬ 
ling  the  primary  task’s  dififculty  -  such  as  changing  display  and  control 
gains  or  varying  an  unstable  mode  -  may  perhaps  also  be  used;  the  latter  of 
these  has  recently  been  implemented  in  a  system  evaluation  study  (Clement, 
1976),  apparently  with  good  results. 
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Finally,  this  study  raised,  quite  coincidentally,  an  interesting  ques¬ 
tion  about  human  perception  of  the  correlation  between  control  inputs  and  the 
output,  its  dependence  on  the  dynamics  of  the  plant  and  the  effects  of  the 
associated  time  lags*  While  this  question  is  basic  to  the  study  of  human  per 
fortnance  in  manual  control  tasks  in  areas  such  as  system  identification,  adap¬ 
tation  and  fault  detection,  it  has  never  been  addressed  explicitly,  to  our 
knowledge. 
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IS  THERE  AN  OPTIMUM  WORK-LOAD  IN  MANUAL  CONTROL? 
By  William  L.  Verplank 
Massachusetts  Institute  of  Technology 


SUMMARY 


Exoeriments  on  a  closed-circuit-television  driving  simulatorwere 
to  answer  the  question:  is  there  an  optimum  work- load  that  (a) 
sustains  performance  in  long-term  driving  and  (b)  facilitates  transitions  o 
new  Sks  or  emergencies?  The  results  affirm  (a)  with  reservations  but  not 
(b) .  A  second  experiment  is  planned  and  a  new  measure  for  work-loa  pro 

posed. 

BACKGROUND 

Automation  or  semi-automation  in  advanced  manual  control  systems  sug¬ 
ars  Xe  question:  is  it  possible  for  tasks  to  be  too  easy?  Most  human 
factors  engineering  has  been  aimed -at  reducing  work- load,  but  can  we  go  too 
farX  is  there  an  optimum  work- load  and  what  is  the  appropriate  measure? 
-he' concept  of  arousal  and  the  evidence  of  vigilance  and  warm-up  decrements 
suggest  the  search  for  such  a  work- load  measure. 

AN  EXPERIMENT 


The  task  was  steering  a  closed-circuit-television  driving  simulator 
(reference  1)  .  The  subjects  were  instructed  to  follow  as  closely  as  .-os-^ 
iX  XXrked  circular  course.  Three  levels  of  task  difficulty  were  pro¬ 
vided  by  adding  different  amounts  of  disturbance  to  the  input  of  the  steer- 

•  n  c»rvo  This  task  corresponds  roughly  to  driving  in  wind  gusts.  T 
mg  servo.  This  task  co  P  A  ^er  in  the  center  of  the  bottom 

«To»  dott.d  center- line  of  the  "road-. 

The  three  conditions  were:  A:  no  disturbance,  B:  a  moderate  distur- 

V  .  .  „  of  a  sum  of  seven  sine-waves  with  the  lower  frequencies  pre- 
bance  =on=ls^g  °f  *  difficult  disturbance  consisting  of  a  sum  of  the  same 
seven  sine-waves  with  the  higher  frequencies  more  dominant.  The  p 
car  was  held  constant  throughout  the  experiment. 

Each  experimental  session  consisted  of  14  laps  (approximately  12  minutes) 
o  aXton  followed  by  2  laps  of  condition  C,  14  laps  at  a  second  con- 
fine*  C.  and  14  lap,  of  tho  third  condition 

di-ion  fol  o  y  Th  subiects  were  familiarized  with  each  of  the 

This  is  an  informal  paper  presented  at: 

12th  Annual  Conference  on  Manual  Control 
University  of  Illinois,  25-27  May,  1976. 
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, ,  ;  .  rrrii  nn  to  be  The  change  in  conditions  always  occurred 

atTthe^same  point  on  the  track.  The  whole  session  lasted  approximately  45 
.  Data  were  recorded  for  three  subjects  over  two  or  three  sessions 

!ach!  with  tS  conditions  presented  in  a  different  order  each  time.  (See  Fig¬ 
ures  2-a  t trough  g  for  the  orders.) 

Eac-  session  provided  data  on  the  six  areas  of  interest:  prolonged  per- 
formancs'under  three  conditions,  and  adaptation  to  condition  C  after  these 
three  different  conditions. 


Instrument ati on 


Both  speed- control  for  the  vehicle  and  generation  of  the  disturbance 
were  provided  by  a  PDP-8  computer  (10  samples/second)  The 
0e  the  car  on  the  track  was  recorded  on  a  chart  recorder  It  was  from  -his 
record  of  position  error  that  performance  scores  were  calculated. 


Data 

The  absolute  value  of  the  position  error  was  read  from  the  chart  by  eye 
at  32  equal lv  spaced  intervals  for  each  two-revoiution  block  of  data.  Thus, 
f  the  Standard  deviation  of  error  in  individual  measurements  from  the  char- 
ifre  20%1w?S  no  Average  bias)  then  the  two-lap  average  absolute  position  er- 
wer<S  ■  Thase  two-laD  average  absolute  position  er 

is  S°°*  to  :-Fi  S  2-a  though  2%.  The  units  ...  centi- 

SSLTJSSS  IVZ  Splay  (T?v.>.  Bath  lap  toot  S1.2  saaonds.  so  the 

two- lap  averages  represent  102.4  seconds  of  data. 


RESULTS 


Hypothesis  I:  Long-term  Performance 
to  test  the  hypothesis  that  performance  will  deteriorate  with  rime,  the 

zzrzz.sssz  usS  sr 

sion. 

t<”  Tsr“yAa^s 

f-via  •  as  the  task  gets  easier  (condition  A)  ,  tnere  wui  ^  * 

suggests  t  -t  mndition  A  was  not  monotonous  enough, 

for  a  vigilance-type  decrement.  Either  cond W“  ^cular  driving  simu- 
or  the  run  (12  minutes)  not  long  enough,  or  for  this  particular  ar 
lation  there  is  no  such  thing  as  an  optimum  task  difficul-y. 

There  does  appear  to  be  ome  difference  however  between  conditions  A  and 
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E  as  they  relate  to  long-term  performance.  As  shown  in  riyai.e  3  there  is 
nv^re  spread  in  the  changes  for  A  than  for  B.  That  is,  under  condition  , 
average  performance  did  not  change  over  the  run,  but  in  condition  A,  it  aid. 
on  soml  sessions  it  deteriorated  over  the  run,  and  on  some,  it  improved 
This  suggested  that  if  stability  of  performance  is  important,  condition 
may  well  be  superior  to  condition  A. 

This  was  examined  by  computing  the  standard  deviation  of  the  two- lap 
averages  for  each  12  minute,  run.  The  normalized  standard  deviations  snown 
in  Figure  4  Here  is  one  measure  that  shows  the  moderate  condition  B  as 
thT«w  condition  A  (p.ircd  t-4.8,  dW,  P  <.«»>.  Thor,  is 
less  variability  in  performance  over  the  12  minute  run.  However,  the  citfer 
ence  between  conditions  E  and  C  disappears. 

Thus,  given  the  present  data,  it  cannot  be  shown  that  a  task  of  nioderate 
difficulty  is  superior  in  long-term  performance  to  both  easier  and  more  dx*fi- 
It  Is  however,  superior  in  certain  ways.  The  moderate  task  R 
did  not  show  a  performance  decrement  over  time,  whereas  the  difficult  task  C 
ail  £he  moderate  task  B  showed  less  relative  variability  than  did  the  easi- 

er  task  A. 

Hypothesis  II:  Adaptation 

Figure  5  shows  error  scores  on  condition  C  immediately  following  ^ex¬ 
tended  run  of  either  condition  A,  B  or  C.  The  two-lap  error  score  is  broken 
.  into  1 -laD  scores.  Figure  5-a  shows  the  score  for  the  first  -s-lap  (12.8 
seconds^?  iver.les  the  score  for  the  first  Vlsp  with  thst  for  the 

(average  for  first  25.6  secondsl .  simil.rly  S-c  and  5-d  are  averages 
of  all  the  performance  up  to  the  specified  points. 

hvdo thesis  XX  would  suggest  that  initial  performance  on  the  new  task  ^ 
would  differ  with  different  preparatory  conditions,  and  that  upon-  adaptation 
to  'die  same  new  task,  performance  would  equalize.  Just  the  opposite  appear 
to  be  the  case.  The  preparatory  conditions  cannot  be  distinguic.  ea  on  -he 
first  k-lap  score  -  only  on  the  entire  two-lap  averages  (5-d)... 

It  is  difficult  to  draw  conclusions  from  the  data  because  of  the  large 
inter-subject  variability.  In  the  two-lap  averages,  the  only  significant  dif¬ 
ference  -is  for  subjects  JK  and  JM,  between  conditions  B  and  C  (paire-  wi 

?^Wt=5  16  df=4,  p<  .0.1).  An  alternative  to  Hypothesis  II  is  proposed 

to  explain  this  difference,  and  depends  on  a  detail  of  the  experimental  pro- 
cedure. 

The  experiment  compared  performance  in  adapting  to  condition  C  follo-in^ 
an  ex tended^ run  of  either  A,  B  or  C.  If  the  -adaptation  block"  was  followed 
bv  t£e  extended  run  of  C,  this  "adaptation"  block  was  counted  as  the  first 
block  in  that  extended  run.  For  "adaptation"  following  condition  C,  the  block 
examined  was  simplv  the  last  block  in  the  extended  run.  This  may  explain  th 
similarity  between  Figures  5-d  and  3,  at  least  as  far  as  condition  C  is  con¬ 
cerned  Figure  3  shows  the  difference  between  the  beginning  and  end  or 
exte^L  2L  For  condition  C,  it  shows  a  significant  decrement  m  performs 
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Similarly,  Figure  5-d  shows  the  averages  at  the  beginning  of  a  run  (C  follow 
ing  A  or  B)  and  at  the  end  of  a  run  (C  following  C) . 

There  are  two  reasons  then  that'  the  data  do  not  support  Hypothesis  IX. 
First,  if  it  is  truly  a  process  of  adaptation,  the  difference  in  performance 
should  be  greatest  at  the  beginning  of  the  new  condition.  The  only  signifi¬ 
cant  difference  is  in  the  direction  predicted  by  Hypothesis  II  (C  worse  than 
B)  ,  but  it  is  not  at  the  beginning,  showing  up  only  in  the  two- lap  average. 
Second,  this  difference  can  be  accounted  for  by  an  alternative  to  Hypothesis^ 
II.  Another  possibility  is  that  a  different  measure  of  the  "preconditioning" 
work-load  could  explain  the  differences  in  adaptation. 


SUMMARY 


Two  hypotheses  were  proposed  relating  the  concept  of  arousal  and  the 
evidence  of  vigilance  and  warm-up  decrements  to  automobile  driving.  There 
should  be  an  opcimum  work- load  that  sustains  performance  on  long  tasks,  and 
facilitates  transition  to  new  tasks. 

A  simulation  of  automobile  driving  under  three  levels  of  difficulty 
showed  a  moderate  disturbance  caused  less  decrement  than  a  difficult  distur¬ 
bance  and  less  variability  than  no  disturbance.  No  condition  was  superior  at 
facilitating  transitions. 

The  present  evidence  is  not  very  strong  support  for  the  concept  of  an 
opt  mum  work-load.  Performance  decrements  in  the  "under- load"  condition  prob¬ 
ably  cannot  be  expected  over  such  short  runs.  A  second  experiment  with  longer 
runs  is  proposed.  The  question  of  what  constitutes  optimality  is  also  open. 

Finally,  the  measure  of  work-load  ought  to  include  a  kind  of  "self-load¬ 
ing"  or  "operator- induced  work- load",  especially  in  the  case  where  there  is 
no  external  disturbance  and  the  operator  is  "tracking  his  own  noise”.  The 
second  experiment  and  a  new  measure  of  work-load  will  be  reported  in  a  forth¬ 
coming  Ph.D.  thesis  [2]. 
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Figure  1.  The  M.I.T.  C.C.T.V.  Driving  Simulator 
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TASK  INTERFERENCE  IN  MULTI-AXIS  AIRCRAFT  STABILIZATION 

By  Edward  D.  Onstott 

Controls  Research 

Northrop  Corporation 
Aircraft  Division 
Hawthorne,  California 


ABSTRACT 


A  time  domain  attention  allocating  multi-axis  pilot  model  has  been  used  to 
examine  task  interference  effects  in  a  two-axis  attitude  stabilization  task  in  turbu¬ 
lence.  Configurations  from  a  matrix  of  uncoupled  lateral  and  longitudinal  F-5 
dynamics  were  analyzed  to  determine  the  influence  of  each  axis  on  the  control  per¬ 
formance  of  the  other.  The  analysis  showed  that  an  inappropriate  choice  of  ongitu- 
dinal  dynamics  would  deteriorate  the  roll  stabilization  with  no  change  in  the  lateral 
dynamics  or  pilot  model.  Furthermore,  the  influence  of  roll  versus  pitch  angle  dis¬ 
play  scalings  were  analyzed  by  hypothesizing  that  the  task  urgencies  used  to  allocate 
attention  in  the  pilot  model  should  be  weighted  by  the  scaling  factors.  A  fixed  base 
simulation  verified  the  model,  the  resonant  task  interference,  and  the  scaling  hypoth¬ 
esis-  further  validation  through  manned  simulation  was  performed  using  a  complete 
nonlinear  YF-1T  aircraft  model  and  the  Northrop  Large  Amplitude  Simulator. 


INTRODUCTION 


Failure  to  match  the  dynamics  of  the  pilot  and  the  aircraft  in  an  optimum  way 
may  result  in  not  achieving  the  best  performance  of  both.  The  result  of  such  dynamic 
mismatch  is  greater  pilot  workload.  Less  accurate  weapon  delivery,  riskier  approach 
and  landing,  and  increased  development  ccst  deriving  from  overdesign  m  an  attempt 
to  achieve  better  pilot  acceptance.  Furthermore,  the  lack  of  precise  methods  for 
analytically  evaluating  the  dynamic  pilot-aircraft  interface  in  the  preliminary  design 
phase  may  lead  to  extensive  and  unnecessary  flight  simulation.  The  analytic  ability 
to  recognize  and  correct  significant  control  shortcomings  as  well  as  to  identify  poten¬ 
tial  improvements  is  important  at  a  time  when  basic  design  concepts  can  be  freely 
adapted  to  new  requirements. 

This  analytical  requirement  has  been  partially  met  for  decades  by  the  use  of 
mathematical  models  of  human  pilot  dynamics.  These  techniques  have  been  success¬ 
fully  employed  in  the  development  of  simple  low  authority  control  systems  where 
greatly  simplified  representations  of  the  aircraft  and  the  pilot  could  be  tolerated. 
References  1-3.  However,  the  advent  of  advanced  tactical  aircraft  with  the  avail¬ 
ability  of  high  authority,  high  order  control  concepts  and  the  associated  demands  for 
superior  piloted  performance  requires  pilot  model  aircraft  analysis  technology  that 
is  considerably  more  general. 
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In  order  to  .meet  the  current  requirements  of  flying  qualities  prediction  and 
evaluation,  Northrop  has  evolved  a  flying  qualities  analysis  procedure  including  com¬ 
plete  generality  of  the  nonlinear  aerodynamics  and  control  system.  Digital  simula¬ 
tion  has  been  adopted  as  an  over-all  context  for  the  dynamic  representation  and  com¬ 
putation.  With  the  availability  of  point-by-point  generation  of  the  flight  history,  pilot 
models  can  be  developed  that  take  into  account  not  only  the  statistical  features  ex¬ 
ploited  by  describing  function  or  optimal  control  methods,  but  specific  dynamic  and 
decision  processes  as  well.  Northrop  has  validated  this  digital  simulation  approach 
to  time  domain  decision  models  which  generate  the  characteristics  of  multi-axis 
piloted  flight.  References  4  and  5.  In  addition  to  the  increased  system  generality,  an 
important'cor.sequence  has  been  the  discovery  of  multi-axis  task  interference  effects 
that  have  never  before  been  identified  or  analytically  computed.  The  objective  of  this 
paper  is  to  present  these  results. 


ATTENTION  ALLOCATION  AND  THE  MULTI-AXIS  HUMAN  CONTROLLER 


A  great  deal  is  known  about  the  dynamics  of  the  human  pilot  performing  contin¬ 
uous  linear  single  axis  tasks.  Much  work  has  gone  into  developing  models  that  match 
the  amplitude  and  phase  characteristics  of  the  pilot's  output  at  the  controller,  and 
many  aspects  of  the  internal  structure  of  the  human  iiave  been  analyzed.  These 
"ultra-precise”  models  are  of  use  in  solving  many  human  factors  problems  about  the 
interface  between  pilot  and  controller,  but  for  the  basic  objective  of  determining  the 
total  system  dynamics,  it  is  usually  sufficient  to  employ  simple  models  that  consist 
of  gain  Kp,  lead  Tl,  time  delay  r,  and  possible  lag  Tj: 


K 


(vyj)  e-,s 

(Tis+1) 


(1) 


The  more  exact  pilot  models  can  certainly  be  employed,  but  for  most  purposes  the 
above  simple  model  gives  good  statistical  results. 


There  have  been  three  main  approaches  previously  taken  in  attempts  to  extend 
single-axis  model  theory  to  multi-axis  tasks.  All  of  these  recognize  that  the  human 
must  operate  as  a  time  shared  device  when  faced  with  difficult  control  tasks  on  several 
independent  axes.  This  shifting  attention  allocation  degrades  the  performance  of  each 
axis  from  what  the  pilot  would  achieve  in  continuous  control.  As  might  be  expected, 
these  three  approaches  are  1)  decrease  the  model  gain  from  the  optimum  for  contin¬ 
uous  control,  2)  increase  the  time  delay  to  account  for  the  periods  of  inattention,  and 
3)  inject  filtered  noise  to  imitate  the  spectral  content  of  the  shifting  pilot  control. 

The  insufficiency  with  these  approaches  is  this:  the  human  pilot  is  quite  dis¬ 
criminating  about  when  he  will  abandon  the  control  of  one  task  to  take  over  the  control 
of  another.  This  leads  to  a  pilot  sampling  criterion  that  is  functionally  dependent  on 
the  total  system  variables.  In  no  way  can  this  be  regarded  as  a  purely  random,  or  a 
regular  sampling.  Thus  a  multi-axis  pilot  model  must  contain  an  algorithm  that  de- 
termines  when  attention  shifting  takes  place,  and  the  model  must  be  computed  in  a 
way  that  preserves  this  information.  Recently,  Northrop  developed  a  multi-axis 
pilot  model  which  does  just  that,  the  urgency  function  model. 
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By  using  the  method  of  digital  simulation,  the  exact  functional  criterion,  by 
which  a  pilot  decides  his  control,  can  be  directly  computed  without  the  gross  distor¬ 
tions  of  linearization.  The  development  of  the  form  of  these  urgency  criteria  has 
now  advanced  to  the  point  where  they  can  be  determined  from  1)  the  system  dynamics, 
2)  the  task,  and  3)  the  appropriate  human  factor  information  about  the  pilot. 

Let  x.  be  the  state  variables  of  one  axis,  x,  of  a  two  axis  task,  and  let  the 
other  axis,  V,  be  represented  by  y..  Then  the  attention  allocation  criterion  for  the 
x  axis  is  satisfied  identically  with  Ihe  inequality 

ux(xi)-°y(yl)  121 

where  Uv  and  Uv  are  the  urgency  functions  of  the  x  and  the  y  tasks.  These  functions 
are  always  nonlinear  in  the  state  variables,  but  fall  into  several  precise  classes. 
Some  of  these  classes  have  been  well  explored,  and  a  tabulation  of  the  urgency  func¬ 
tions  for  attitude  stabilization  is  included  below. 

The  multi-axis  urgency  function  model  thus  consists  of  simple  linear  dynamics, 
equation  (1),  along  with  the  control  criterion  of  (2).  Whichever  axis  has  the  larger 
urgency  function  gets  the  corrective  control  attention.  The  adjustment  of  the  linear 
coefficients  can  usually  be  obtained  by  an  easy  search  starting  with  the  optimum 
single  axis  coefficients.  Almost  always,  the  optimum  multi-axis  coefficients  differ 
significantly  from  these  values,  especially  in  multiloop  control  on  the  two  axes. 


TWO-AXIS  ATTITUDE  STABILIZATION  IN  TURBULENCE 


It  is  natural  to  inquire  about  applications  of  the  Northrop  urgency  model  to  rep¬ 
resentative  tactical  fighter  dynamics.  Since  attitude  stabilization  in  the  presence  oi 
low-level  turbulence  is  an  almost  ever-present  requirement  of  any  Class  IV  mission, 
this  task  was  selected  for  both  analytical  and  experimental  study  using  degraded  F-5 
aircraft  models.  The  primary  objective  was  to  verify  the  time-sharing  of  the  human 
pilot,  the  appropriateness  of  the  urgency  function  attention  allocation  algorithm,  and 
the  statistical  accuracy  of  the  predictions.  In  order  to  control  the  number  of  Param¬ 
eters  involved  in  optimizing  the  model,  display  and  controller  effects  were  minimized 
in  the  flight  simulation.  This  involved  using  a  large  CRT  display  so  that  display 
motions  were  amplified  enough  to  eliminate  visual  threshold  effects.  Recently  a  gen¬ 
eral  study  of  this  problem,  including  visual  threshold  effects,  has  been  completed 
using  the  general  YF-17  aircraft  model.  The  results  of  this  problem  are  also 
discussed. 

The  simulation  display  consisted  of  an  illuminated  dot  against  a  dimly  illumi¬ 
nated  grid.  Vertical  displacement  of  the  dot  represented  pitch  angle,  and  lateral 
displacement  bank  angle  as  shown  in  Figure  1. 

The  actual  scaling  of  the  dot  displacement  was  achieved  experimentally  by 
going  to  the  most  sensitive  scope  setting  which  would  accommodate  all  dot  excursions. 
These  scalings,  n  and  a,  had  units  of  degrees  per  centimeter  of  dot  deflection,  and 
provision  was  made  to  simulate  at  various  values  of  u  to  a  rat. os.  As  this  m  &  rat .o 
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FIGURE  1.  TWO-AXIS  FLIGHT  SIMULATION  DISPLAY 


increased  from  zero  (continuous  lateral  tracking  only),  the  pilot  was  forced  to  pay 
o-reater  and  greater  attention  to  the  longitudinal  task.  One  of  the  most  interesting 
hypotheses  tested  was  the  manner  in  which  this  took  place.  In  all  flight  simulation 
tests  the  pilot  was  instructed  to  minimize  the  rms  distance  of  the  dot  from  the  center 
of  the  display.  This  is  called  the  radial  error,  denoted  by  r(<*>,  0  ),  and  is  given  by 


r(0,  9  ) 


<t>) 


(3) 


In  this  way  the  n/a  ratio  greatly  influences  the  objective  urgency,  and  the  hypo¬ 
thesis  tested  is  that  the  objective  urgency  as  determined  by  the  ///a  ratio  scales 
exactly  as  the  rath  -  of  the  urgency  functions  of  the  multi-axis  pilot  model  Up  and  UR, 

that  is. 


"A  =  up/uR 

Limitations  in  display  resolution  allowed  only  the  range  of  values  shown  in 
Figure  2. 


(4) 
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1 

0 

Continuous  Longitudinal  Only 

FIGURE  2.  URGENCY  RATIOS  TESTED 
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It  should  also  be  remarked  that  the  aircraft  dynamics  and  data  reduction  ot  .the 
flight  simulation  were  digitally  generated.  A  frame  time  of  0.05  second  was  used, 
and  the  turbulence  was  generated  using  a  digital  random  noise  source  with  a  0. 15  kHz 
bandwidth,  thus  assuring  independence  of  the  two  gust  time  histories  produced  by  the 
digital  filters.  The  equations  of  motion  and  all  dynamic  computations  were  pro¬ 
grammed  exactly  as  in  the  analytical  digital  simulation  model  described  below.  It 
should  be  noted  that  the  method  is  in  no  way  restricted  to  linearized  equations  of 
motion.  In  the  YF-17  studv  general  nonlinear  six-degree-of-freedom  equations  were 
used  with  no  change  in  the  pilot  model  or  its  use.  Three  degrees  of  freedom  were 
mechanized  laterally,  while  the  u  equation  was  not  required  longitudinally,  thus  giving 
five  degrees  of  freedom  in  all.  The  equations  of  motion  are  shown  in  Figure  3. 


\v  = 

1  /  n  -L  r  )  n  -  7  Af*  +  7  fw  -  A  \ 

(5i 

l-z.'Uo  V4  te  0  w\  gw/ 

\v 

q  = 

V  -Mje,e*Mw(w- V) 

9  = 

q 

(7) 

r  - 

N-r  r  -  N-p  p  *  Si,  »•  *  NJ r  »r  -  -,tt) 

b  =  y;  p  -  r<x  -  yp  aa  x  yir ar  *  yv  ag*) 

F  o 

P  -  V,Lir,Li>“‘  Hr  ,r  *  VS 

4>  =  p 

FIGURE  3.  EQUATIONS  OF  MOTION 
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The  augmentation  of  each  configuration  was  included  in  the  stability  derivatives,  since 
washout  is  not  required  for  a  zero  mean  small  perturbation  attitude  stabilization  task. 

The  task  proved  difficult  to  fly,  and  about  10  hours  were  required  for  asymptotic 
training.  Data  were  collected  for  test  periods  of  30  seconds,  and  simulation  sessions 
were  held  to  two  hours  maximum  with  frequent  rest  periods.  In  addition  to  the  digital 
data  reduction  of  tracking  statistics,  strip  chart  recordings  were  collected  for  each 
configuration  and  each  u/a  ratio  tested.  In  order  tc  provide  a  variety  of  F-5  config¬ 
urations,  three  longitudinal  and  two  lateral  sets  of  dynamics  were  selected  and  com¬ 
bined  in  a  matrix  of  six  configurations,  designated  as  shown  in  Figure  4. 

The  turbulence  simulation  used  Dryden  spectra  obtained  by  filtering  Gaussian 
white  noise  as  described  in  Mil-  F-8785B,  where  an  air  speed  of  718  fps  and  an  alti¬ 
tude  of  1750  feet  were  used.  In  order  to  diminish  the  effects  of  drift,  a  40-db/decade 
high-pass  prefilter  was  used  on  the  random  number  noise  source,  with  the  break 
point  set  at  0. 3  radian.  The  hand  controller  was  mounted  on  the  side  of  a  chair  and 
had  light  breakout  and  gradient  forces;  both  the  sensitivity  and  the  polarity  were  se¬ 
lected  by  the  subjects. 
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Lateral 


FIGURE  4.  F-5  CONFIGURATION  DESIGNATION 


Analytically,  this  problem  was  studied  by  means  of  digital  simulation.  This 
system  model  includes  the  six-degree-of-freedom  linear  or  nonlinear  airframe  equa¬ 
tions  of  motion,  turbulence  and  ommand  tracking  generation,  linear  pilot  model  gain, 
lead,  lag  and  delay,  urgency  function  switching  algorithms,  pilot  remnant,  pilot 
inadvertent  crossf  ied,  rms  statistics,  histograms  of  control  episodes,  and  urgency 
function  delay. 

Since  the  turbulence  provided  a  large  disturbance  to  the  system,  it  was  not 
necessary  to  model  remnant  and  inadvertent  crossfeed.  The  commands  to  the  system 
are  simply  0=0  and  0  =  0. 

In  order  to  validate  the  model  as  a  predictive  method,  the  prediction  algorithm 
should  be  understood.  The  various  constants  and  functional  forms  are  identical  to 
those  used  in  the  earlier  validation  examples  (Reference  4).  The  model  ;akes  the 
following  form  for  two-axis  mu’tiloop  attitude  control: 

1.  Assume  a  linear  model  on  each  axis  with  a  gain,  lead  (for  Class  IV 
aircraft,  0.5  sec),  and  delay  of  0.3  sec  (this  includes  neuromuscular 
lag  which  can  be  modeled  separately  if  desired). 

2.  Assume  continuous  control  in  single-axis  tasks,  split  control  in  multi- 
axis  tasks.  For  a  two-axis  task,  the  pilot  model  will  switch  from  one 
axis  to  the  other  based  on  testing  the  urgency  functions: 

P  axis  controlled  if  and  only  if  Up  >  UR 


3.  The  urgency  functions  take  the  form 


(12) 


where  0  is  equal  to  zero  in  single  loop  multi-axis  tasks  (this  is  the  case 
in  the  two-axis  turbulence  problem). 
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4#  The  quantity  a  and  its  counterpart  u  on  the  other  axis  are  determined  by 
the  die  o  lay.  (This  is  the  hypothesis  mentioned  above:  that  subjective  and 
objective  urgency  must  agree. ) 

5.  Given  the  above,  the  model  is  optimized  in  the  following  manner: 

a.  Optimize  each  pilot  model  gain  for  a  continuous  single-axis  task. 

b.  Use  these  gains  to  optimize  the  radial  error  r  (q,  d) 

r(<f>,  0)  =  ,y^e)2  +  (u0Qf  (13) 

by  perturbing  £  and  its  counterpart  v. 

c.  The  gains  should  again  be  checked  since  it  often  occuru  that  optimum 
multi-axis  gains  are  somewhat  lower  than  single-axis  optimum  gains. 

6.  With  this  procedure  complete,  the  model  can  be  exercised  to  produce  time 
histories  of  all  dynamic  quantities,  all  statistics  of  mean,  rir.s,  control 
episode  periods,  dwell  fractions,  and  other  parameters  which  serve  as  pre¬ 
dictions  of  the  performance  of  an  asymptotically  trained  human  controUer 
for  any  system  disturbance  to  be  modeled  such  as  pilot  perception  threshold, 
turbulence,  or  commands,  both  continuous  and  discrete,  Gaussian  and  non- 
Gauss  ian. 


For  the  two-axis  attitude  stabilization  task  in  turbulence,  the  above  algorithm 
simply  requires  optimizing  two  quantities  (the  pilot  model  gains)  in  the  separate  con¬ 
tinuous  single  axis  tasks.  Hence 


Longitudinal  Urgency:  UD  =  v 

P  1  (14) 

Lateral  Urgency:  UR  =  a  |<£ 


Before  examining  the  quantitative  agreement  of  the  model  with  the  flight  simula¬ 
tion,  certain  qualitative  similarities  with  the  data  will  be  discussed.  Although  there 
has  been  a  wide  and  rapidly  growing  recognition  that  the  human  pilot  behaves  as  a  time 
sharing  device,  the  sharp  division  between  lateral  and  longitudinal  control  has  not 
often  been  directly  exhibited.  Two  types  of  examples  will  be  shown.  The  first  illus¬ 
tration  is  a  trace  taken  during  the  simulation  of  lateral  versus  longitudinal  stick  deflec¬ 
tion,  Figure  5.  Notice  that  nearly  all  stick  motion  is  parallel  to  either  the  lateral 
or  the  longitudinal  control  axes.  Also  note  that  these  traces  show  that  the  pilot  spent 
most  of  his  time  performing  lateral  corrections  (which  is  consistent  with  the  dwell 
fractions  computed  for  this  ft /a  ratio  of  8/1,  approximately  0.75  lateral  control).  The 
second  and  most  telling  illustration  of  the  pilot1  s  switching  from  one  control  axis  to 
the  other  is  shown  in  Figure  6.  Here  the  actual  strip  chart  record  is  reproduced 
for  a  twenty-second  sample  of  the  two-axis  task.  The  control  episodes  have  been 
marked  with  a  step  function  showing  the  pilot’s  apparent  control  shifting.  If  the  reader 
will  follow  the  entire  record  starting  at  time  zero,  the  left  hand  side,  he  can  compare 
the  alternation  of  the  control  as  indicated.  Furthermore,  if  the  O  and  6  time  series 
are  compared,  it  is  clear  why  control  shifting  takes  place  when  it  does. 
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atfons  produce  variations  in  the  actual  turbulence  level  for  any  given  data  run,  in  both 
the  model  and  the  flight  simulation. 

The  first  data  to  be  considered  are  an  overall  comparison  of  tracking  error 
oredicSJ  S  tte  simulation  data.  Figure  7  shews  a  plot  of  longitudinal  .  Simula- 
Hon  data  versus  model  predictions.  The  simulator  data  are  averaged  from  the  30- 
so  "oSttsStor  “oXious,  ,/a  -  3,  and  ,/a  -  16  tasks,  while  the  model  data  were 
obtained  from  time  histories  spanning  1000  seconds.  Figure  8  gives  a  similar  com- 
nirisnn  for  the  lateral  <t>  data.  The  agreement  here  is  slightly  less  good  for  the 
J/a  -  16  cases  owing  to  experimental  display  constraints;  at  this  urgency  ratio,  toe 
m ntionsS  sometimes  went  off  the  display.  In  particular,  note  the  extremely  accu¬ 
rate  predictions  for  the  single-axis  task  (squares).  Since  the  pilot  was  instructed  to 
minimize  the  rms  distance  of  the  dot  from  the  center  of  the  display,  it  is  of  value  to 
re  ti,e  radiai  error  data.  This  is  shown  for  the  two-axis  data  in  Figure  9.  It 
should  be  remarked  upon  that  this  agreement  between  model  predictions  and  simula¬ 
tion  data  for  a  two-axis  task  rivals  currently  reported  results  in  many  single  axis 

studies. 


FIGURE  7.  MODEL  VERSUS  FLIGHT  SIMULATION  DATA  FOR  ALL 

LONGITUDINAL  TASKS 
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Flight  Simulation 


FIGURE  3.  MODEL  VERSUS  FLIGHT  SIMULATION  DATA  FOR  ALL 

LATERAL  TASKS 


FIGURE  9.  RADIAL  ERROR  OF  MODEL  VERSUS  FLIGHT  SIMULATION  FOR 
TWO-AXIS  ATTITUDE  STABILIZATION 


It  is  useful  to  examine  the  individual  configurations.  A  Particular ^  type  of  dia¬ 
gram  was  evolved  that  portrays  ^he  two  axis  and 

explanatory  examp  -  » ®  l^gitudinal  tracking  error.  In  order  to  see  how  a 

Zticular  Ta  ratiolom^ares  with  the  open-loop  aircraft  response  and  with  the 
particular  /i/a  backing  performance,  these  values  have  been  drawn  as  lines 

diagram  as,  for  example,  the  line  labeled  -open 

inon  Ttals  the  two-a?{s  performance  would  be  expected  to  fall  somewhere  in  the 
loopo.  Thustneu  _  P  For  an  urgency  ratio  of  zero,  i.e.,  continuous 
rectangle  bounde  .  ore(jiCtion  is  the  lower  right-hand  corner  of  the  rectangle. 

"reshotvn  and  labe.ed  with  the  »  4  ratio.  The 
averaged  model  data  are  also  sh^wn  and  labeled. 

By  referring  ^“ext^ 

^s“h  rsr..“  3112  ss  &  «.  «*  rst 

si-.,if,eant  finding^  Sve  the B.  but 

matching  are  important  aspects  of  two-axis  handling  qualities  that  are  seen  co  exist 
experimentally,  and  are  subjects  that  can  be  accurately  predicted  and  evaluated  by 
the  two-axis  urgency  function  pilot  model. 
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A  farther  point  to  observe  is  that  even  in  the  configurations  where  the  lateral 
tracking  error  is  diminished  in  the  u/a  —  16  tasks,  the  data  still  fall  along  the  lines 
of  model  predictions  regarded  as  interpolations  of  the  urgency  ratios.  This  indicates 
that  when  the  display  limit  forced  additional  attention  to  the  lateral  task,  the  pilot 
simply  shifted  his  subjective  urgency,  i.  e. ,  his  actual  weighting  of  <t>  and  6,  thus  dis¬ 
placing  the  data  along  the  urgency  interpolation  curves.  In  the  other  cases  where  the 
display  limit  was  not  encountered,  the  data  fall  close  to  the  curves.  This  indicates 
two  things*  First,  the  pilot's  subjective  urgency  and  the  objective  urgency  of  the 
actual  display  gains  closely  agree;  second,  the  greatest  source  of  model  prediction 
inaccuracy  is  a  failure  of  these  two  urgencies  precisely  to  match  up  owing  to  experi¬ 
mental  constraints  not  modeled  in  this  example. 

There  is  one  further  statistical  comparison  that  has  quite  important  conse¬ 
quences  to  aircraft  design  and  air  safety,  namely,  the  variation  in  tracking  error  as 
shown  in  the  performance  data  dispersion  of  the  30-second  simulations.  The  follow¬ 
ing  Figures  13  and  14  show  these  30-second  data  for  two-axis  flight  simulation  data 
as  well  as  30-second  model  data  for  a  comparable  number  of  flights.  The  significant 
feature  of  two-axis  handling  qualities  that  these  figures  exhibit  is  the  propensity  of 
certain  configurations  to  large  excursions  and  tracking  error,  while  others  appear 
more  uniform.  This  demonstrates  a  varying  sensitivity  to  fluctuations  in  the  effec¬ 
tive  turbulence  power  spectrum  and  probability  density  during  the  finite  duration 
flight  histories.  Here,  the  model  and  the  flight  simulation  show  a  large  dispersion, 
with  many  flights  performing  much  worse  laterally  than  open-loop  response.  These 


FIGURE  13.  MODEL  AND  SIMULATION  DATA  DISPERSION  FOR  30  SEC  FLIGHTS 

F-5  IB  M/a  =  16 
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PIGURE  14.  MODEL  AND  SIMULATION  DAMPERS, ON  TOR  30  SEC  PLIGHTS 

elfects  indicate  that  not  onlpd lo  certain  combtaa Cto  a  ^3^”? “rfor- 
dynamics  lead  to  deleriorat  excursion  encountered  in  examples  such  as  IB 

mance  as  well,  Tll^.a^ua^attlfcu__  pnnilp.h  to  iea(j  to  loss  of  control.  This  effect  is 
could,  in  an  actual  flight,  be  severe  g  human  controller,  since  such  sensitivity 
totally  due  to  the  two-axis  time  sha™}^  configurations  involv- 

is  not  observed  in  the  continuous  s  g  t02:ether  produce  these  examples.  Con- 

ing  the  lateral  or  longitudinal  dynamics .that of  225  secrnds  lateral 
sidering  IB  further,  Figjire  15  modeU  ^  hi^  ^  ^ 

tracking  continuous  case,  there  is  a  total  absence  of  large  excur¬ 
sions  ^WlMn^the't^  large  fluctuations  occur  in  local  tracing  per¬ 

formance  as  well  as  numerous  severe  sudden  excursions. 

two-axis  attitude  stabilization  of  the  yf-17  in  turbulence 

In  April  WggfiSSStii  MS 

ob^ectives  of  this  Study: 

.  Demonstrate  the  generaUty  of  the  Northrop  pilot-aircraft  model 
.  VaUdate  the  model  for  external  visual  flying 
.  Evaluate  the  importance  of  modeling  visual  deadbands. 
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FIGURE  15.  LATERAL  TIME  HISTORIES  FOR  SINGLE  AND  TWO-AXIS 
TURBULENCE  TRACKING  OF  B  LATERAL  DYNAMICS 

The  aircraft  model  included  all  nonlinear  aerodynamics  terms  and  parametric 
table  look-ups  as  well  as  the  nonlinear  control  system  including  automatic  flap  setting. 
The  cockpit  was  used  with  no  change  from  the  YF-17  configuration,  including  stick 
gradients  and  all  functioning  instruments.  Dryden  turbulence  was  generated  for  * 
lateral  v  and  longitudinal  w  gusts,  and  stabilization  flights  were  of  thirty— second 
duration. 

In  order  to  incorporate  visual  deadband  in  a  realistic  manner,  the  wide-angle 
visual-system  earth-sky  projector  projected  a  horizon  of  mountainous  terrain.  By 
having  the  pilot  (a  former  U.  S.  Navy  test  pilot)  attempt  to  hold  zero  roll  angle,  the 
presence  of  visual  deadband  cculd  be  both  demonstrated  and  measured  from  the  strip 
chart  recordings.  This  deadband  then  was  mechanized  into  the  pilot  model  as  an 
inequality  test,  not  as  injected  white  noise.  In  this  way,  the  model  of  the  deadband 
preserved  the  dynamic  characteristics  of  the  achial  flight  simulation  as  well  as  led  to 
the  correct  statistics. 

The  model  of  the  piloted  YF-17  was  exercised  by  inserting  the  pilot  model  into 
the  simulator  computers,  a  small  modification.  Flights  simulations  were  completed 
for  single-  and  two-axis  tasks  at  six  flight  conditions  as  shown  in  Figure  16. 
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FIGURE  16.  YF-17  FLIGHT  CONDITIONS  STUDIED 

Plots  of  model  versus  flight  simulation  for  the  single-axis  tasks  are  given  in  Fig¬ 
ures  17  and  18,  where  the  lateral  model  data  have  been  completed  using  the  deadband, 
3  degrees.  The  data  for  the  two-axis  tasks  are  shown  in  Figures  19  and  20.  The 
closf agreement  of  these  data  between  the  model  and  the  flight  simulation  validates 
the  model  including  the  visual  deadband.  Computations  of  the  model  without  the 
deadband  agree  poorly  for  the  flight  conditions  of  high  dynamic  pressure  where  the 
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FIGURE  17.  LATERAL  TURBULENCE  TRACKING  OF  THE  YF-17  SINGLE  AXIS 
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MODEL 


FIGURE  20.  LONGITUDINAL  TURBULENCE  TRACKING  OF  THE  YF-17  TWO-AXIS 


turbulence  disturbances  are  low,  being  roughly  one-fifth  to  one-quarter  of  the  dead¬ 
band  model  values.  The  effects  of  dispersion  between  the  data  of  the  thirty-second 
flight  identified  in  the  study  of  the  F-5  in  turbulence  also  show  in  the  flight  conditions 
of  the  YF-17  at  low  dynamic  pressure.  The  data  from  the  individual  flights  are  pre¬ 
sented  for  the  six  flight  conditions  in  Figures  21  through  26. 
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FIGTRE  22.  TWO-A  XIS  TRACKING  nATA  FOR  30: 000  FT  AT  MACH  0*4 
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FIGURE  23.  TWO-AXIS  TRACKING  DATA  FOR  5,  000  FT  AT  MACH  0. 8 
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FIGURE  24.  TWO-AXIS  TRACKING  DATA  FOR  40, 000  FT  AT  MACH  0. 8 


FIGURE  26.  TWO-AXIS  TRACKING  DATA  FOR  10,  000  FT  AT  MACH  1*  1 
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APPLICATIONS 


Since  these  methods  of  digital  simulation  and  multi-axis  pilot  models  allow  large 
scale  maneuvers  to  be  studied  as  the  closed  loop  nonlinear  and  time-varying  problems 
that  they  are,  there  are  a  large  number  of  applications  which  can  be  undertaken. 
Northrop  is  currently  working  on  several  major  examples,  and  further  work  will  in¬ 
clude  the  following  problem  areas. 


Weapon  Delivery 

Both  air-to-air  and  air-to-ground  problems  can  be  studied.  In  dive  bombing, 
for  example,  the  analysis  includes  target  acquisition,  roll-in,  acquisition  of  the  glide 
slope  tracking  using  the  weapon  sight,  intense  but  briefly  unstable  attitude  rate 
stabilization  prior  to  weapon  release,  and  pull  up.  The  entire  maneuver  can  be  digi¬ 
tally  "flown”  many  times  in  the  presence  of  the  realistic  environment  of  low  level 
turbulence  and  pilot  induced  system  noise,  and  the  statistics  of  impact  exror  produced. 


Landing  Approach 

The  aerodynamics  of  landing  approach  are  not  well  represented  by  constant 
coefficient  linear  uncoupled  dynamics.  Furthermore,  the  landing  task  in  the  presence 
of  low  level  turbulence  make  the  use  of  the  exact  nonlinear  models  of  the  problem 
necessary.  Much  has  been  learned  from  linear  methods  about  the  form  of  the  control 
strategy  that  the  pilot  must  use,  and  the  study  of  the  behavior  of  the  multi-axis  pilot 
model  can  utilize  this  information  to  produce  a  comprehensive  analysi  ?  of  the  entire 
maneuver. 


Coordination  of  Analysis  and  Flight  Simulation 

The  multi-axis  pilot  model  technologies  developed  at  Northrop  will  not  replace 
piloted  flight  simulation,  but  in  fact  can  be  used  to  greatly  improve  its  efficiency.  Not 
only  will  consistent  analytical  and  experimental  results  give  greater  credibility  to  both, 
but  the  ability  to  screen  away  needless  experiments  by  analysis  will  also  improve  the 
overall  efficiency  of  aircraft  evaluatior  and  design. 


Loss  of  Control  at  High  Angles  of  Attack 

Once  aerodynamic  descriptions  of  this  flight  condition  are  determined,  the  non¬ 
linear  and  time-varying  modeling  capabilities  of  the  Northrop  methods  can  be  immedi¬ 
ately  applied.  The  multi-axis  pilot  models  required  will  be  similar  to  the  ones  de¬ 
veloped  for  attitude  stabilization.  As  an  example,  consider  Figure  27.  This  is  the 
time  history  of  one  axis  of  a  two-axis  attitude  stabilization  problem.  The  system  re¬ 
mained  well-behaved  for  120  seconds  at  which  time  task  interference  effects  coupled 
through  the  pilot  model  caused  a  sudden  divergence  of  the  system  in  .  3  seconds. 
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FUTURE  RESEARCH 


,mrt  from  continuing  work  in  the  areas  of  the  applications  discussed  above 

s=: 

indicated  in  Figure  28,  which  is  regarded  as  the  complete  p  lot  mo  e  . 
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FIGURE  28.  TOTAL  PILOT  MODEL 
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Estimation  and  observation  include  both  the  evaluation  of 
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perception  position 
represent  the  gain 


and  rate  uncertainties  and  thresholds.  Adaptive  algorithms  to 
optimizing  capability  of  the  pilot  are  also  under  investigation. 


CONCLUDING  REMARKS 


A  comprehensive  context  for  flying  qualities  prediction  and  evaluation  has  been 
,  n  in  order  to  achieve  sufficient  generality  of  both  pilot  and  aircraft.  Based  on 

the  central  part  of  this  technology  is 

mpnt  of  muUi-axis  time  domain  pilot  models  which  include  not  only  the  feati  -s  of 
dynamical  compensation  and  human  factors  data  relating  to  human  observation  and 
estimation,  but  decision  and  evaluation  capability  as  well. 

In  order  to  validate  the  decision  models  for  attitude  stabilization  in  turbulence, 
analytical  and  experimental  research  was  completed  using  fixed  and  moving  base 
flight  simulation  of  F-5  and  YF-17  aircraft.  The  pilot-vehicle  system  model  was 
adjusted  bv  optimization  either  before,  or  independent  from  the  corresponding 
,  i  •  «  Tn  t-his  wav  the  predictive  capabilitv  of  tne  method  ^as  demon- 

b?  the  model  closely  matched i  the 

w^fied  and  reproduced  aaalytleally. 

As  a  result  of  this  vaUdation  study,  the  importance  as  well  as  the  practicality 
of  the  basic  methodology  has  been  demonstrated.  General  applications  are  now 
nossible  and  ba~’.c  research  into  the  decision  models  for  piloted  control  casks  \\i 
include  weapon  delivery,  loss  of  control  at  high  angles  of  attack,  approach  and  lan  - 
!ng  and  dynamic  pilot-aircraft  interface  matching  of  the  longitudinal  with  tne  lateral- 

directional  dynamics  of  the  aircraft. 


3. 
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SUMMARY 


The  computer's  knowledge  of  how  the  human  has  allocated  his  attention 
is  posed  AS  an  important  issue  in  the  design  of  human-computer  systems  where 
the  two  decision  makers  have  overlapping  responsibilities.  It  is  argued 
that  it  is  inappropriate  to  require  the  human  to  continually  tell  the  compu¬ 
ter  how  he  has  allocated  his  attention.  Instead,  a  computer  algorithm 
employing  fading-memory  system  identification  and  linear  discriminant 
analysis  is  proposed  for  real  time  detection  of  human  shifts  of  attention 
in  a  control  and  monitoring  situation.  Experimental  results  are  presented 
that  validate  the  usefulness  of  the  method. 


INTRODUCTION 


While  many  processes  can  be  completely  automated  and  thus,  no  longer 
require  a  human  as  part  of  the  system,  there  is  a  large  class  of  processes 
where  only  partial  automation  is  possible  and  require  that  the  human  remain 
part  of  the  system.  Thus,  a  human-computer  system  is  formed.  One  of  the 
most  important  issues  in  the  design  of  human-computer  decision  making 
systems  is  the  allocation  of  decision  making  responsibility  LlJ.  In  many 
cases,  where  the  human  and  computer  have  overlapping  abilities,  it  seems 
desirable  for  them  to  have  overlapping  responsibilities  [2j.  Thus,  allo¬ 
cation  of  decision  making  responsibility  becomes  situation  dependent  in 
that  a  particular  task  is  assigned  to  the  decision  maker  who,  at  the  moment, 
has  the  time  to  allocate  to  the  task  in  question.  The  main  shortcoming  of 
this  approach  is  the  possibility  of  conflicts  in  the  sense  that  the  two 
decision  makers  lose  track  of  what  each  other  is  doing  which  results  in  their 
competing  to  perform  tasks.  Such  competition  can  have  disastrous  effects 
on  performance  [3]. 

An  intuitive  solution  to  this  problem  is  to  have  each  decision  maker  ^ 
tell  each  other  of  their  actions.  This  may  be  reasonable  from  the  computer's 
point  of  view,  but  such  a  dialogue  might  significantly  increase  the  human's 
workload  and  subvert  the  main  reason  for  utilizing  the  computer.  Thus,  we 
need  some  method  for  determining  what  the  human  is  doing  without  continually 
asking  him. 

*Supported  by  the  U.  S.  Air  Force  Systems  Command  under  Contract  F33615-73- 
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This  paper  considers  this  problem  in  the  context  of  a  control  and  moni¬ 
toring  situation.  As  an  example,  in  an  aircraft,  the  pilot's  control  task 
is  maintaining  attitude,  altitude,  heading  and  speed  while  his  monitoring 
task  involves  maintaining  his  knowledge  of  the  state  of  his  numerous  sub¬ 
systems  (i.e.,  electric,  hydraulic,  etc.).  Assuming  a  computer  system  was 
available  for  aiding  the  pilot  in  detecting  and  perhaps  correcting  changes 
in  subsystem  states,  then  the  computer  would  require  a  knowledge  of  the 
pilot’s  perception  of  sub-system  states.  Otherwise,  they  might  individually 
initiate  actions  that  would  jointly  be  counterproductive  or  perhaps  disas¬ 
trous  . 

We  could  easily  develop  other  examples  for  various  human-vehicle 
systems.  However,  we  will  jump  immediately  to  the  essence  of  the  problem. 
The  basic  question  is:  How  can  the  computer  determine  that  the  human  has 
allocated  to  his  monitoring  task  a  portion  of  his  attention  in  excess  of 
what  is  needed  merely  to  scan  the  monitoring  situation? 

This  paper  proposes  a  method  for  real  time  detection  of  how  the  human 
has  allocated  his  attention.  The  method  utilizes  a  fading-memory  system 
identifier  in  conjunction  with  a  linear  discriminant  function  to  detect 
changes  in  the  dynamics  of  the  control  situation  that  indicate  a  shift  in 
the  human's  allocation  of  attention  in  excess  of  what  is  expected  with 
normal  scanning.  We  will  now  proceed  to  discuss  the  method  in  detail  and 
consider  experimental  results  that  validate  the  usefulness  of  the  method. 

interested  reader  can  find  more  detail  in  Enstrom  s  thesis  L^J  • 


APPROACH 


Considerable  research  has  been  devoted  to  the  study  of  manual  control 
systems  and  the  identification  of  the  human's  dynamics  (i.e.,  transfer 
function  or  state  model)  in  control  situations  C 5 , 6 J .  We  require  a  method 
that  can  operate  in  real  time  on  a  digital  computer  and  successfully  identify 
cime -variations  in  system  parameters.  A  fading-memory  least-squares  identi¬ 
fication  algorithm  was  chosen. 

The  human  plus  controlled  process  were  modeled  as  a  discrete  linear 
system  using 

X(k  +  1)  =  ?X(k)  +  T  U(K) ,  (1) 

Z(k  +1)  =  H  X(k  +  1).  (2) 

where  X  is  an  n  state  vector,  U  is  a  p  input  vector,  5  is  an  n  X  n  state 
transition  matrix,  T  is  an  n  X  p  disturbance  transition  matrix,  and  H  is 
an  m  x  n  matrix.  Following  Lee  [7],  these  two  equations  may  be  transformed 
into  a  canonical  form,  and  then,  assuming  a  single-input,  single-output 
system,  into  a  difference  equation  of  the  following  form. 
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Z(k)  -  Z  -  a  Z(k  -  i)  +  b  U(k  -  i) 
i=l  1 


(3) 


la  matrix  notation,  this  equation  becomes 

Z(k)  -  S(k  -  1)  '0 

where  S(k  -  1) '  =  [Z(k  -  n)  ...  Z(k  -  1)  U(k  -  n). 


U(k  -  1)]  and 


(4) 


0  =  l -a. 


-a.  b  •  •  • 
1  a 


b^] •  By  minimizing  the  squared  model  estimation 


U  X  X 

error,  the  model  parameter  vector  0  is  iteratively  estimated  using 


®(k  +  1)  =  2>(k)  +  D(k)[z(k  +  1)  -  S(k)'$(k)]  (5) 

4 

where 

D(k)  =  P(k)  S(k)[s(k)'P(k)  S(k)  +  l]*1  (6) 

P(k)  =  P(k  -  1)  -  D(k  -  1)  S(k  -  1)'  P(k  -  1)  (7) 


Initially  0(0)  =  0  while  P(0)  is  usually  set  equal  to  an  arbitrarily  large 
disgonal  matrix. 

In  order  to  assure  unbiased  estimates,  the  method  requires  that  $  be 
updated  only  after  every  n  inputs,  where  n  is  the  order  of  the  difference 
equation  model.  (This  avoids  biases  due  to  correlated  residuals).  A  further 
modification  reflecting  this  requirement  produces  the  following  set  of  equa¬ 
tions  . 


0[(k  +  l)n]  =  $  (kn)  D(kn)[z(kn  +  1)  -  S(kn)  '  p  (kn)],  (8) 

where 

D(kn)  =  P(kn)  S(kn)[S(kn) '  P(kn)  S(kn)  +  l]*1,  (9) 

P(kn)  =  P[k(n  -  1)]  -  D[k(n  -  1)]  s[k(n  -  1)]'  P[k(n  -  1)]  .  (10) 

The  application  of  these  equations  will  identify  single-input,  single¬ 
output  time  invariant  systems.  However,  the  method  must  be  modified  if 
systems  with  time -varying  parameters  are  to  be  identified. 

A  slight  modification  of  derivations  by  Mendel  [8]  and  Morrison  [9] 
allows  the  identification  error  vector  at  time  j ,  E ( j ) ,  to  be  multiplied 
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.  so  that  the  new  squared  error  term  to 

7,  i323Tf.T.qI'  <a>r  C id)'  iu)3.  The  vector  «(j>  U  «— ««- 

from  the  relationship 


W(j)  =  —  p'J 


W(j  -  1) 


,  „  ic  ,i.,avs  less  than  or  equal  to  1.0.  By  making  p  less  than  1.0, 

nt  errors  hive  a  greater  influence  than  old  errors  in  the  formulation 
recent  errors  *  Frrors  are  forgotten  as  the  weighting  coeffi- 

&£  ~  flMl 

algorithm  equations  are 

5[(k  +  l)n]  =  0  (kn)  +  D(kn)[2(kn  +  1)  -  S(kn) '  %  (kn)],  (12) 


where 


D(kn)  =  P(kn)  S(kn)[s(kn)'  P(k«)  S(kn)  +  p2a]  1  (13) 

-  J-lVkCn  -  1)]  -  D[k(n  -  1)1  S[k(n  -  1)]'  P[k(n  -  1)1  •  (14) 
p2tll 

2n  u- 


Throughout  the  teuaioder  of  this  paper,  the  tern  P  "  will  he  referred  to 
as  the  memory  coefficient. 

Given  that  we  can  adaptively  identify  the  dynamics  of  the  human  plus 
nrocess  we  need  a  method  of  determining  when  the  dynamics 
reflect  shifts  of  attention.  The  method  chosen  was  linear  discriminant 

analysis  [l0]. 

Assume  that  we  have  events  that  fall  into  two  classes  and  that  for 
_a_h  event  i  we  have  m  measurements  denoted  by  y  . ,  i  1»  2>  * 

£ea  Se  class  to  which  a  particular  event  belongs^can  be  determined  using 

c.  =  or+P^j  +  32y2j  +  +  °m^mj  (15) 

where  c  =  1  denotes  membership  in  class  1  while  c  =  2  denotes  membership 

*  =us4  2-  »  -  che.sirs“  r  ffy  rics  ss*' be  £77744 

4-1  2  ...  X  are  determined  and  i  -  i,  •  •  •  >  ra 

J  ""  Aj  y  9  9  m  9  ™ 

if  i  ^  m. 


First,  the  measurements  are  normalized  using 
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c . 
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=  c. 
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6=1 


(17) 


where  4.  is  the  number  of  events  which  belong  to  class  i.  Next  a  set  of 
simultaneous  linear  equations  are  formed 
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(18) 


or,  more  compactly, 


Y 


6  =  C  . 


The  least-squares  solution  is  the  familiar 

§  =  (Y'  Y)'1  y'  c 


(19) 


(20) 


Tatsuoka  discusses  issues  such  as  whether  or  not  two  classes  are  signifi¬ 
cantly  different  and  the  determination  of  the  probability  of  class  member¬ 
ship.  The  interested  reader  is  referred  to  reference  10. 

The  above  identification/discrimination  methodology  is  proposed  for 
determining  how  the  human  has  allocated  his  attention.  The  following  dis¬ 
cussion  of  experiments  will  detail  the  specifics  of  the  method's  implemen¬ 
tation. 


EXPERIMENTS 


Three  experiments  were  designed  to  test  the  proposed  identification/ 
discrimination  method.  While  the  first  experiment  tested  the  scheme's  ^ 
eeneral  ability  to  detect-  extreme  rhanjef  in  subject  tracking  dynamics,  t..c 
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conditions  of  the  second  experiment  were  chosen  to  represent  more  realisti 
situations  and  test  the  method  over  a  range  of  dynamic  systems.  The  last 
experiment  was  closely  controlled  to  provide  data  from  a  group  of  well- 
trained  subjects. 

An  identical  testing  setup  was  used  in  all  of  the  three  experiments. 

A  one-dimensional  pursuit  tracking  task  was  displayed  in  the  vertical 
direction  on  a  CRT.  (See  Figure  1) .  The  CRT  was  a  Hewlett  Packard  model 
1310A  with  a  screen  size  of  38.1  X  27.9  cm.  A  PDP-11  was  coupled  with  a 
custt  built  graphics  display  unit  to  drive  the  CRT  display.  A  two  axis 
displacement  joystick  from  Measurement  Systems  Inc.  (model  521)  provided 
cie  means  for  controller  input.  Later,  when  side  tasks  were  added  to  the 
tracking  task,  the  subjects'  answers  were  entered  via  a  keyboard  unit  from 
an  InfoLn  Vi^tar  II  terminal.  To  simplify  the  operation  of  the  keyboard, 
the  unneeded  keys  were  covered  and  the  carriage  return  button,  which  was 
used  to  signal  the  completion  of  data  entry,  was  extended  for  easy  location. 
(See  Figure  2.)  All  of  the  tracking  tests  were  five  minutes  in  length. 

The  displayed  input  symbol  was  a  square  of  size  0.68  cm  per  side,  and 
the  output  symbol  was  a  cross  of  similar  size.  (See  Figure  3.)  The  subject 
wa!  instructed  to  move  the  joystick  so  as  to  match  the  cross's  position  as 
closely  as  possible  to  that  of  the  moving  square.  The  input  path  was  gener¬ 
ated  by  passing  gaussian  white  noise  through  a  second  order  filter.  A 
sampled -data  second  order  filter  (At  =  0.10  sec),  with  a  gain  of  1.0 
natural  frequency  of  1.0  radian  per  second,  and  a  damping  ratio  of  0.5  pro¬ 
duced  an  input  path  (filter  output)  with  a  standard  deviation  of  447  screen 
units.  The  screen  coordinates  in  the  vertical  direction  were  -2047  to  _ 

+  2047.  The  same  input  path  was  used  for  all  tests  and  the  square  s  posi¬ 
tion  was  always  updated  at  a  rate  of  ten  times  per  second. 

The  joystick  range  was  plus  and  minus  30  degrees  from  the  vertical 
position  with  the  full  range  of  the  joystick  matching  the  full  range  of 
the  screen  coordinates.  The  joystick  gain  was  68.2  screen  units  per  degree, 
that  is  a  one  degree  movement  in  the  joystick  resulted  in  an  input  change 
of  68.2  screen  units.  Two  stable  and  two  unstable  control  dynamics  were 
selected  for  use  in  the  experiments.  (See  Table  1.)  The  joystick  was  o 
negligible  mass  and  was  originally  operated  without  spring  resistance.  The 
update  frequency  for  the  cross  was  10  times  per  second  for  experiments  one 
and  two  but  was  changed  to  20  times  per  second  for  experiment  three.  Also, 
a  light  resistance  spring  was  added  to  the  joystick  and  controller  gam  was 
increased  from  1.0  to  2.0  for  experiment  three. 

In  the  first  experiment,  one  subject  was  tested  with  two  different 
controller  dynamics.  Both  tests  consisted  of  alternate  periods  of  normal 
tracking  and  periods  of  no  tracking,  where  the  joystick  location  was  kept 
stationary.  In  the  first  test,  the  subject  used  a  type  2  controller  (see 
Table  1)  to  track  for  60  seconds  intervals,  with  dead  times  between  these 
periods  of  10,  20,  and  30  seconds.  The  second  test  was  performed  with  a 
type  1  controller.  After  a  60  second  period  of  continuous  tracking,  the 
remainder  of  the  five  minute  trial  consisted  of  alternate  periods  of  30 
seconds  tracking  and  5  seconds  of  dead  time. 
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Figure  3.  The  tracking  situation. 


Ill 


I 


\ 


After  testing  the  identification  scheme  with  these  extremely  l«ge 
.  in  subiect  control  dynamics,  a  more  realistic  side  task  replaced  the 

changes  in  subject  contr  y  requiring  the  multiplication  of  two 

dead  times  c^s^L^et“Cs^ondary  task.  The  subject ‘s  answer  to  the 
numbers  was  chosen  as  C  speciany  designed  keyboard  (see  Figure  2). 
problem  was  entered  s°“al Activity  associated  with  the  side  tasks,  the 
To  minimize  the  p  y  removing  eye  contact  from  the  CRT  screen. 

tit *  l  !».«. «  «. 

Each  subject  was  All  0f  Che  subjects  tested  were 

tSfh°andefand  Sh  controlled  the’joystick  with  his  right  hand  and 
operated  the  keyboard  with  his  left. 

_.  .  ..  uas  introduced  to  the  subject  by  displaying  it  on  the 

e.  .  a(.reen  (See  Figure  4.)  When  the  side  task  appeared,  the  sub- 
CS/Sd  i^tructed  c»pUte  rha  answer  and  enter  It  on  the  keyboard 

if  nared  rate  that  would  eliminate  mistakes.  The  multiplication 
at  a  self-p  ,  nCtl  Che  return  key  was  hit,  signalling  completion 

Tthe"d^  TO  P  e'e«  ur«asonabyiy  long  reaction  ti„es  or  neglect 

f  rhe  main  «Sing  task,  subjects  were  scored  on  both  RMS  tracking  error 
of  the  mam  tracking  c  .  ^  late  the  real  side  tasks.  Peer  pressure 

and  the  ^vera®  f  lcs  appeared  to  serve  as  ample  subject  motivation, 

due  to  the  -  f  u  U  SthPstde  tasks,  each  multiplication  included 
a°™”drgit  nLb«  and  Iyone  digit  uuub.r;  fanning  a  throe  digit  product 
less  than  250. 

The  scanning  required  to  detect  and  interpret  a  side  task  would  require 
„  .^  attention  shift  before  mathematical  calculations  were  begun.  To 

determine  if  the  scanning  effect  resulted  in  significant  ch*ng**  “ 

,  .  "a-rH  ficialM  side  tasks  were  also  used*  An  artificial 

Td  !rTll  reouire  visual  scanning,  but  would  not  necessitate  mental 
.-you  after  the  text  was  recognized  as  a  false  command.  These  tasks 

ell  y  cr“od  after  bling  displayed  for  five  seconds.  Three 
“™r  !de  task  »ere  included  in  each  five  minute  test  -  always  at  trues 
actual  side  t  subjects  of  the  third  experiment  were  only 

•  '  ft  t^o  Hde’tasJ  run“  it  is  unlikely  that  the  timing  of  the  tasks 
e-pcsed  to  wo  side  tas  ^  ^  subjec.ts#  artificial  side  tasks 

would  have  -tributed  throughout  the  five  minute  tracking  period,  with 

were  tasks  would  always  fellow  real  side  tasks 

one  restri  •  guarantee  completion  of  the  real  task 

S£:rrt£“n“c"of  ar.rtiSci.1  one.  See  Table  2  for  a  scary 
of  the  side  task  times,  types  and  screen  locations. 

In  the  second  experiment,  two  subjects  were  tested  with  three  sets  of 

i  a  --Q  For  each  dynamic  type,  both  subjects  were  run  with  and 

concro  dynamics  ^^ach^ynamic,  „  ^  ^  ^  ^  ^  famiUari2e 

without  side  t  ^  Th  d  .  and  allow  him  to  devise  a  satisfactory 

the  s»b3ect  with  the  g^em^y^^  ^  f.fteen  minute  period,  a  seCond  test  was 

operating  J^.'GOUtroi  dynamics  and  the  predetermined  side  tasks.  Side 
^kl  wuJ  difSrent  arithmetic  problems  were  used  on  successive  tests. 


113 


I 


TABLE  2 

SIDE  TASK  INFORMATION 


Number 

Type 

Position 

X 

Y 

4.720 

15 

K 

2200 

1500 

4.444 

14 

K 

700 

1800 

4.000 

13 

X 

3000 

300 

3.860 

12 

K 

3100 

3500 

3.750 

11 

K 

1200 

700 

3.470 

10 

K 

2300 

2500 

3.000 

9 

X 

1000 

2200 

2.720 

8 

K 

100 

1800 

2.500 

7 

K 

3000 

1500 

2.000 

6 

X 

2100 

2800 

1.720 

5 

ir 

IV 

1000 

1500 

1.250 

4 

K 

3000 

2000 

0.805 

3 

K 

50 

50 

3 

cr> 

2 

K 

500 

3000 

0.278 

1 

K. 

2200 

500 
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Four  subjects  were  carefully  trained  on  control  type  4  for  the  last 
experiment.  Four  practice  trials  of  straight  tracking  were  spaced  within 
a  three  day  period.  After  the  fourth  trial,  the  subjects  were  tested  again 
with  side  tasks.  The  initial  reactions  to  the  side  tasks  were  more  drastic 
than  anticipated.  Several  side  task  errors  occurred,  while  one  subject  lost 
control  of  the  output  cross,  causing  the  coordinates  of  the  cress  to  exceed 
the  boundaries  of  the  CRT  display  area.  For  this  reason,  these  side  task 
trials  were  duplicated  on  the  following  day  with  different  arithmetic 
problems.  Subject  number  three  was  run  a  third  time,  due  to  an  error 
resulting  in  loss  of  tracking  data. 


RESULTS 


The  raw  data  collected  from  each  experimental  trial  consisted  of  the 
input  and  output  signals  for  the  tracking  tasks  and,  when  side  tasks  were 
included  in  the  trial,  start  and  completion  times  for  each  task. 

Results  of  the  first  experiment  proved  that  the  identification/ 
discrimination  method  was  capable  of  adapting  to  unrealistically  large 
changes  in  the  control  system.  The  two  data  sets  were  analyzed  using  a 
second  order  difference  equation  model  and  a  range  of  memory  coefficients. 
Substantial  changes  occurred  in  the  model’s  four  parameters  directly  corres¬ 
ponding  to  the  dead  periods  of  joystick  control.  An  identification  memory 
coefficient  of  0.96  produced  model  parameter  changes  that  were  easily 
detected  by  visual  inspection  of  time  plots  of  the  parameter  values.  Since 
the  conditions  of  this  experiment  were  oversimplified,  further  analysis 
was  not  pursued. 

The  remaining  discussion  of  side  task  detection  will  be  divided  into 
two  areas.  First,  the  parameters  of  the  identification  scheme  (i.e.,  model 
order  and  memory  coefficient)  will  be  discussed.  Then,  the  application 
of  the  linear  discriminant  function  will  be  considered.  The  tracking  and 
side  task  data  from  experiment  two  is  the  basis  of  nese  discussions. 
Finally,  we  will  discuss  the  results  when  the  data  from  the  third  experiment 
was  processed  according  to  the  findings  from  experiment  two. 

The  difference  equation  model  order,  which  was  arbitrarily  set  to 
two  for  experiment  one,  was  studied  by  forming  control  models  of  orders 
one  thru  five  for  the  six  non-side  task  tracking  data  sets  from  experiment 
two.  The  outputs  from  these  models  were  compared  to  the  real  output  values 
to  determine  an  RMS  fitting  error.  The  identification  model  memory 
coefficient  was  set  to  1.0  for  this  analysis  since  side  tasks  were  not 
used,  producing  fairly  constant  system  parameters.  As  expected,  the 
fitting  error  decreased  with  increasing  model  order.  Trade-offs  between 
accuracy  and  feasibility  for  use  in  real  time  applications  resulted  in  a 
compromised  selection  of  a  second  order  model. 

To  adaptively  identify  time-varying  system  parameters,  a  memory 
‘  coefficient  less  Lhan  l.G  was  required.  As  the  coefficient  was  decreased, 
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changes  in  the  system  would  affect  this  identification  scheme  quicker. 
However,  noise  and  inaccuracies  would  also  increase  wit.*!  smaller  memory 
coefficients.  Thus,  a  tradeoff  was  necessary  between  identification  te- 
sponse  time  and  the  accuracy  of  the  results.  To  determine  a  suitable 
memory  coefficient,  the  RMS  fitting  error  was  determined  for  side  task 
data  runs  from  experiment  two  with  memory  coefficients  of  1.00  0.95,  0.S0, 

0  80  and  0.70.  The  fitting  error  decreased  as  the  memory  coefficient 
was  reduced  and  then  started  to  increase  with  a  coefficient  of  0.70.  ^ 
value  of  0.90  was  selected  since  memory  coefficients  less  than  0.90  produced 
eratic  parameter  values  -  even  for  non-side-task  tracking  periods. 

Now  that  a  second  order  model  and  a  memory  coefficient  of  0.90  have 
been  established,  the  use  of  discriminant  analysis  to  detect  control  system 
changes  will  be  discussed.  A  reasonable  first  choice  of  disc.irainant 
measures  was  the  four  parameters  of  the  model,  (i.e.,  the  four  coetficients 
of  the  second-order  difference  equation) .  If  changes  in  these  parameters 
were  linearly  related  to  side  task  attention  shifts,  the  discriminant 
function  would  be  able  to  detect  the  side  task  periods.  Visual  examination 
of  the  parameter  vs.  time  plots  indicated  that  absolute  changes  in  the 
parameter  values  did  not  always  correlate  well  with  the  side  task  times. 

A  measure  representing  tha  relative  changes  of  the  parameters  was  proposed 
to  correspond  more  consistently  to  the  side  task  periods.  A  long  term 
average  of  each  parameter  was  calculated  and,  by  comparing  the  current 
parameter  value  to  the  long  term  reterer.ee  value,  relative  changes  were 
measured.  By  summing  the  relative  changes  over  short  periods  of  time, 
four  more  discriminant  measures  were  formec.  A  final  measure  was  added 
to  the  group  of  eight  properties  by  forming  an  average  of  the  squared 
cracking  errors. 

Now  we  will  consider  the  effects  of  some  of  the  variables  associated 
with  the  detection  scheme.  Such  variables  include  the  window  size  used 
in  the  formation  of  the  long  term  parameter  reference  values,  the  relative 
measure  window  size,  and -the  a  priori  probability  associated  witn  each 
classification  type.  Then,  we  will  consider  the  scheme  s  ability  to  use 
a  single  set  of  sample  class  measurements  to  analyze  trial  data  runs  wiuhm 
subjects  and  across  dynamics  -  and  within  dynamics  and  across  subjects. 

The  scheme's  performance  with  fewer  discriminant  measures  is  also  ^f 
interest. 

We  have  not  attempted  to  determine  the  optimal  settings  for  the 
identification/discrimination  scheme.  Instead,  our  efforts  were  aimed  at 
showing  some  overall  trends  and  characteristics  associated  with  the  side 
task  detection  algorithm.  In  many  cases  when  the  effect  of  a  single 
variable  was  analyzed,  the  effects  of  all  ocher  variables  were  assumed 

to  be  independent. 

Long  term  reference  window  sizes  of  10.0,  20.0,  25.0,  and  30  0  seconds 
were  considered.  Fixing  the  probability  of  a  side  task  event  at  0.05  and 
a  relative  measure  window  size  of  6.0  seconds,  we  found  that  a  reference 
window  size  of  >5."  seconds  was  the  only  value  that  detected  every  side  - 
task,  and  resulted  only  in  false  alarms  that  could  be  related  to  artificial 
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side  tasks. 

Relative  window  sizes  of  4.0,  6.0,  8.0,  12.0  and  20.0  seconds  were 
considered.  Although  the  results  did  not  show  any  one  window  size  to  be 
consistently  superior  to  the  others,  a  window  size  of  8.0  seconds  normally 
produced  side  task  detections  with  a  greater  degree  of  certainty  than  the 

others . 

The  discriminant  function  requires  an  initial  probability  guess  for  the 
likelihood  of  an  event  being  from  one  of  the  two  classes.  As  the  proba¬ 
bility  estimate  of  an  events  occurrence  is  decreased,  the  discriminant 
function  will  predict  a  lesser  number  of  events  for  that  particular  class. 
Data  from  experiment  two  was  analyzed  using  probabilities  of  a  side  task 
event  equal  to  0.50,  0.10,  0.05,  and  0.01.  We  found  that  a  probability 
of  0  05  resulted  only  in  false  side  task  detections  that  were  possibly 
related  to  an  artificial  side  task,  and  the  one  real  side  task  detection 
that  was  missed  was  also  missed  with  values  of  0.10  and  0.01. 

Until  this  point,  the  discriminant  function  learning  samples  were 
formed  from  the  same  data  file  that  was  to  be  analyzed.  Possibly  the 
similarity  among  subjects  and/or  dynamics  was  great  enough  to  permit  the 
use  of  one  set  of  discriminant  function  coefficients  in  the  analysis  of 
different  experimental  trials.  Because  experiment  two  used  only  two 
subjects,  but  three  different  controller  dynamics,  this  data  will  be 
analyzed  across  the  dynamics,  but  within  each  subject.  Individual  learning 
samples  were  summed  together  making  one  large  learning  file.  The  results 
of  these  tests  are  shown  in  Table  3.  Overall,  the  detection  algorithm 
did  not  work  very  well  across  the  different  dynamics  used  in  this  inves¬ 
tigation.  However,  note  that  in  both  subject  sets  the  controller  type  2 
run  was  identified  far  better  than  the  rest.  Possibly  the  discriminant 
function  for  this  control  system  was  near  the  average  of  all  three. 

Nine  measures  were  used  with  the  discriminant  function  to  take  full 
advantage  of  as  many  measures  as  seemed  reasonable.  The  number  of 
measurements  required  to  make  detections  was  considered  by  analyzing  the 
same  sets  of  data  with  a  decreasing  number  of  measures.  In  each  reduction, 
the  poorest  remaining  indicator  was  dropped.  The  average  order  of  impor¬ 
tance  of  the  measures  from  experiment  two  were  as  follows: 


Order  of 
Importance 


Measure 

Number 


Description 

Relative  Change  of  Measure  No. 
Relative  Change  of  Measure  No. 
Input  Value  0.2  Seconds  Ago 
Relative  Change  of  Measure  No. 
Relative  Change  of  Measure  No. 
Out  Value  0.2  Seconds  Ago 
Input  Value  0.1  Seconds  Ago 
Output  Value  0.1  Seconds  Ago 
Tracking  Error  Squared 


118 


1 


X 


i 


I 


! 


} 

t 

I 


TABLE  3 

RESULTS  OF  USING  SAME  DISCRIMINANT  FUNCTION 
ACROSS  DYNAMICS,  WITHIN  SUBJECTS 


Subject 

Controller 

Number  of  2  Second 
Intervals  Detected 

Number  of  2  Second 
Intervals  Falsely 

During 

Side  Task  No. 

Detc 

cted 

Number 

Type 

1 

2 

3 

Total 

R.A.S.T.* 

1 

1 

1 

1 

2 

3 

i 

3 

6 

0 

0 

0 

1 

0 

2 

0 

6 

1 

3 

0 

0 

0 

0 

0 

0 

0 

0 

2 

2 

2 

L 

2 

3 

6 

0 

8 

0 

4 

0 

0 

0 

0 

0 

* Number  related  to  artificial  aide  task 


TABLE  4 


REDUCING  THE  NUMBER  OF  DISCRIMINANT  FUNCTION  PROPERTIES 


9 

8 

7 

6 

5 

4 

3 

2 

1 


6 

0 

0 

0 

0 

0 

0 

0 

0 


2 

0 

0 

0 

0 

0 


0 

0 

0 

0 

0 

0 


4 

1 

1 

1 

0 

0 

0 

0 

0 


8 

4 

4 

4 

4 

2 

1 

0 

0 


8 

8 

8 

8 

8 

8 

7 

6 

6 


3 

3 

2 

2 

2 

2 

0 

0 

0 


7 

6 

6 

6 

6 

6 

2 

2 


5 

5 

5 

5 

4 

3 

1 

1 

0 


0 

0 

0 

1 

1 

0 


5 

4 

4 

4 

3 

3 
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Three  Latent  points  should  be  noted  ab .out  the  «d. «olJ» **™, 

P„pn«y ust.  -*-=•  *» ““  °f th* 

better  indicator  than  the  absoi  P  v  ^  ^  relative  measures, 
first  five  important  properties^  r  initially  thought  to  be  a  good 

Secondly,  the  error  squared  tern  co/surpris  Ing  after  review- 

indicator  was  the  P«°tes';  »  ^  Mt  only  ,  function  of  the  subject  s 

ing  the  circumstances .  .  £  t^e  input  position  of  which  the 

output  position,  b^  fS0TJe  £inai  point  is  that  the  most  important  variable 
subject  has  no  control.  The £1  related  t0  the  input  value'  of  0.2 

was  the  relative  change  o  reaction  time  delay  for  a  human  controller 

seconds  ago.  Since  e  coniecture  that  the  subject's  time  delay  is 
is  in  this  range,  we  might  The  results  of  reducing  the  number 

affected  during  smrts  o.  "  in  Table  4.  of  the  six  data  sets 

of  discriminant  function  m  successfully  processed  using  only  five 

test,d,  four  -  •^•STSS  set.  contained  side  t.shs  that 

^ereenot  dieted  with  the  total  nine  properties. 

»w  data  from  the  third 

parameters  (i-e.,  ”°  *  seiond  experiment.  Summarizing  these  parameters, 

mined  from  the  data  of  the  identified  four  parameters  of  a 

the  final  side  task  detection  algorith  J fading-memory  identification 

second  order  difference  equation  mo  ^  ^  ^  four  model  parameters  plus  a 

method  with  a  memory  coefficien  'hanRes  plus  a  tracking  error  squared 

relative  «-.«£  •  “f  ^“"discriminant  function  which  was  "trained 
rthTampU  da«  fr»  each  of  the  cwo  classifications. 

The  general  results .  of  ax^ri-ent 

that  every  real  artificial  side  tasks.  Thus,  the  results  of 

were  possibly  related  annlicable  to  at  least  some  additional  control 

sy sten^wl tlT'dif fer ent*" sub jects^and^ different  centrolier  types. 

The  data  from  process^a'acrosrsu^jSts  within 

3f  »r  £=£*£? *W« 

These  results  appear  m  Table  6^  dynamics.  Notice  that  only  one  real 

for  different  subjects  using  t  false  deCections  were  again 

side  task  was  not  detecte ’tasks.h  This  is  somewhat  surprising  because,  the 

related  to  artificia  trackins  skills  -  one  was  much  better, 

four  subjects  did  not  have  similar  tracking  sum 

while  one  was  consistently  poorer. 

Finally,  the  algorithm's 

Using  a  DEC-10  computer,  .  ..  daCa  from  a  ftve  minute  trial.  The 

variables  and  read  in  t  hc  measurements  (the  squared  error  term  was 

r»xr«%ictiLs^th  p~-< 

‘b3o”t  S  ^ond^rrSed  to  process  a  tocal  of  five 
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GENERAL  RESULTS  OF  EXPERIMENT  3 


Subject 

Controller 

Number  of  2  Second 
Intervals  Detected 

Number  of  2  Second 
Intervals  Falsely 

Number 

Type 

During 

Side  Task  No. 

Detected 

1 

2 

3 

Total 

R.A.S.T* 

A 

6 

5 

6 

0 

0 

j 

4 

5 

6 

■4 

4 

4 

4 

6 

6 

5 

4 

4 

11 

10 

3 

8 

2 

2 

0 

2 

2 

0 

Number  related  to  artificial  side  task 


TABLE  6 

RESULTS  OF  USING  SAME  DISCRIMINANT  FUNCTION 
FOR  ALL  SUBJECTS  IN  EXPERIMENT  THREE 


Subject 

Number 

Controller 

Type 

Number  of  2  Second 
Intervals  Detected 
During  Side  Task  No. 

Number 

Inters 

Det 

of  2  Second 
ils  Falsely 
:ected 

1 

2 

3 

Total 

R.A.S.T.* 

3 

4 

5 

6 

4 

4 

4 

4 

6 

4 

6 

5 

3 

0 

4 

7 

5 

9 

2 

5 

3 

0 

3 

0 

3 

0 

3 

0 

*Number  related  to  artificial  side  task 
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minutes  of  tracking  data. 
Chan  a  DEC-10,  a  PDP-11  or 
for  real  time  application 


Assuming  a  PDP-11  to  be  five  to  ten  times  slower 
other  similar  sized  computer  could  also  be  used 
of  the  detection  algorithm- 


CONCLUSIONS 


If  a  human  and  computer  are  to  successfully  interact  in  situations 
where  they  have  overlapping  responsibilities,  they  need  to  know  what  each 
,  .  doine  However,  it  is  unreasonable  to  require  the  human  to  co 

tinually  inform  the  computer  of  his  actions.  A  method  is  needed  for  the 
computer  to  detect  what  the  human  is  doing. 

This  paper  has  considered  human-computer  interaction  in  control  and 
.  rnrin/situations  and  proposed  a  method  utilizing  a  fading-memory 
^idLtifierand  linear  discriminant  analysis  that  allows  real  time 
Sy  H-rHon  of  how  the  human  has  allocated  his  attention  between  the  control 
and  monitoring  tasks.  Experimental  results  have  validated  the  usefulness 
of  the  method. 

To  actually  implement  the  method  in,  for  example,  an  aircraft  or  space¬ 
craft  further  research  is  needed  to  develop  heuristics  tor  handling  problem 
such  as  periods  when  the  human  completely  stops  tracking. .  Multi-class 
discriminant  analysis  and  perhaps  some  nonlinear  method  might  be  appropna 

to  this  problem. 

The  proposed  method  may  also  be  useful  as  a  research  tool  for  those 
investigating  motor  skills  or  attention  allocation.  Such  research  would  be 
of  use  in  interpreting  the  outputs  of  multi-class  discriminant  analyses. 

To  conclude,  we  have  posed  what  is  a  very  real  problem  in  the  design 
f  comnuter  systems.  Our  solution,  in  the  context  of  control  and 

Situations,  appears  feasible  la  a  real  tine  setting,  but  teguires  . 
more  development  work  before  actual  implementation. 
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ABSTRACT 


Advancing  computer  technology  has  facilitated  the  implementation  of 
on-line  adaptive  man-machine  systems.  In  these  systems,  computer  d®cJf^on 
based  on  information  concerning  the  state  of  the  operator  can  affect  phe 
nature  of  the  man-machine  interaction.  Some  limitations  of  performance 
measures  as  sources  to  provide  information  to  the  computer  are  presented, 
and  it  is  argued  that  these  limitations  are  particularly  restrictive  when 
the  information  concerns  selective  attention  in  a.  monitoring  task,  or  the 
assessment  of  residual  attention.  It  is  suggested  that  psychophysiological 
measures  such  as  the  event  related  cortical  potential  (ERP)  may  be  utilized 
”  bypas,  thus.  limitations.  Th«  ohar.cteri.tics  of  tha  ERP  and  axpati.ant.l 
demonstrations  of  its  sensitivity  to  attentional  manipulation'-  are  described 
aSTprogram  of  relevant  tesaatoh  is  then  outlined.  This  udes  a  des- 
cription  of  a  pilot  investigation  of  the  relationship  betv  en  the  E 
auditory  probe  stimuli  and  the  workload  demands  of  a  two  uimensional  track 

ing  task. 


PSYCHOPHYSIOLOGICAL  MEASURES  IN  THE  ADAPTIVE  MAN-MACHINE  SYSTEM 


The  remarkable  developments  in  mini-  and  micro-computers  is  transform¬ 
ing  the  design  of  man-machine  systems.  The  computer  industry  is  producing 
smaller,  faster  and  more  economical  computers.  It  is,  theretore,  increas¬ 
ingly  easier  to  incorporate  computers  as  control  components  m  man-machine 
systems  with  striking  effects  on  the  flexibility  of  the  systems.  Mo st 
notable  is  the  increasing  prevalence  of  adaptive  systems.  In  these 
computer  can  affect  the  nature  of  the  man-machine  interaction  by  *”?iHo 
dynamically,  an  optimizing  algorithm.  System  behavior  can  be  adjusted  to 
the  continually  changing  states  of  the  operator,  the  environment  and  to  the 
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interaction  between  the  two.  This  flexibility  depends  on  an  exchange  of  _ 
information  between  the  operator  and  the  machine.  Within  the  framework  or 
a  computer  based  system  it  is  possible  to  conceive  of  novel  channels  of 
communication  to  supplement  conventional  display  and  control  interfaces. 

This  report  describes  a  program,  which  is  one  of  several  Biocybernetic 
programs  supported  by  ARPA's  Cybernetic  Technology  Office,  which  seeks  to 
develop  such  a  communication  channel  utilizing  information  which  can  be 
derived  from  brain-waves. 

The  environment  for  which  we  developed  these  channels  is  exemplified 
bv  the  hypothetical  high  performance  aircraft  with  on-board  computer  facili¬ 
ties  represented  in  Figure  1.  The  conventional  Display,  Pilot  Control  ana 
Plant  components  are  supplemented  by  various  performance  aids  which  can  be 
implemented  or  adjusted  on-line.  These  may  consist  of  the  addition  or 
removal  of  predictive  display  information,  an  alteration  of  the  -cntrol  o. 
Plant  dynamics  along  various  axes  or  perhaps  the  assumption  of  control  ot 
certain  tasks  normally  performed  by  the  operator.  In  order  that  these 
adaptive  decisions  be  made  intelligently  by  the  on-board  aiding  center,  a 
managing  computer  or  Decision  Center  shown  in  Figure  1  must  be  well  informed. 
The  information  it  requires  includes  obviously  characteristics  of  the 
mission,  the  status  of  other  aircraft,  vehicular  disturbances  and  ground 
controlled  inputs.  Additionally,  it  would  be  extremely  useful  to  the 
Decision  Center  to  have  estimates  of  two  important  aspects  of  the  operator  s 
attentional  state:  what  information  he  is  processing  or  ignoring  at  any 
time  (selective  attention),  and  his  overall  level  of  workload  or  involvement 
with  on-going  tasks.  Operator  workload  in  this  sense  is  often  conceptual izec 
as  a  variable  that  is  reciprocally  related  to  the  amount  of  residual  atten¬ 
tion  available  to  deal  with  unexpected  environmental  contingencies  (1). 

Various  performance  measures  are  traditionally  used  to  index  atten.-.r. 
and  workload.  These  have  been  found  to  provide  adequate  indices  of  both 
aspects  of  the  attentional  state.  For  example,  in  a  multi-display  selective 
attention  task,  the  allocation  of  attention  between  sources  of  discrete 
stimuli  has  been  revealed  by  response  latency  (2,  3)  or  accuracy^ . 
Attention  allocation  to  continuous  tasks  has  been  successfully  identified 
through  changes  in  tracking  gain  (2,  4),  information  transmitted  (4),  time 
delav  (5),  holds  in  the  tracking  output  (6,  7),  remnant  or  observation  noise 
(8,  9),  or  by  more  complex  coefficients  of  a  linear  discrete  time  series  . 

model  (10) . 

Operator  workload  or  residual  attention  has  often  been  measured  by  tne 
"secondary  task  loading"  technique  (11).  The  operator  is  presumed  to  possess 
some  limited  quantity  of  processing  resources  which  can  be  distributed  among 
various  tasks.  As  a  primary  task  is  made  more  difficult— its  workload  deman 
increased— a  greater  quantity  of  the  limited  resource  is  required  to  maintain 
criterion  performance,  and  conversely  a  lesser  amount  remains  to  perform  tne 
secondary  task.  As  a  consequence,  secondary  task  performance  decreases, 
serving  as  an  index  of  primary  task  difficulty.  Such  techniques  have  proven 
reliable  in  comparing  different  display  or  control  configurations  (12),  or 
in  validating  subjective  estimates  of  control  task  difficulty  (13). 

While  performance  measures  do  serve  adequately  as  indices  of  selective 
stteotlon*  their  use  is  necessarily  restricted  to  tasks  in  which  overt 
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responses  are  produced.  Thus  they  are  inappropriate  for  a  class  of  tasks 
that  are  becoming  critically  important  in  modern  system  control:  passive 
monitoring  during  which  few  overt  responses  are  emitted  by  the  operator. 

To  assess  the  allocation  of  attention  during  a  monitoring  task,  periodic 
orobe  events  must  be  inserted  in  the  environment  and  an  overt  (detection) 
response  to  the  probes  required.  Such  probes  are  of  necessity  disruptive 
to  ongoing  performance.  The  difficulties  encountered  by  an  on-line  assess¬ 
ment  of  workload  through  secondary  task  loading  are  more  severe.  T  e 
secondary  task  performance  may  well  disrupt  performance^ on  the  primary 
task  with  possiblv  critical  consequences.  It  may  also  saturate  t..c 
residual  attention  that  it  was  designed  to  assess. 

Even  in  situations  where  the  above  restrictions  are  not  encountered, 

(for  example  divided  attention  between  two  information  transmission  tasks), 
a  further  limitation  upon  the  usefulness  of  performance  measures  is  presented 
bv  the  inherent  response  variability  which  precludes  reliable  estimates  of 
attention-sensitive  parameters  from  a  single  observation.  Thus  assessment 
of  anv  or  all  of  the  performance  measures  described  above  must  involve  a 
number  of  behavioral  observations  taken  over  time,  a  requirement  whicn 
further  limits  the  usefulness  of  these  measures  in  closing  an  on-line 
adaptive  loop  such  as  that  s'noxvn  in  Figure  1. 

The  shortcomings  of  performance  measures  point  to  an  urgent  need  for 
additional  sources  of  information  x,hich  can  either  supplement  or,  as  in  the 
monitoring  and  xrorkload-measurement  situations  described  above ,  replace  tne 
performance  measures  in  providing  information  to  the  Decision  Center.  It 
is  for  this  reason  that  we  have  initiated  an  investigation  to  determine  if 
nsvchophysiological  measures  can  serve  as  indices  of  human  information 
processing  characteristics.  Psvchophvsiological  measures  have  two  important 
properties  that  counteract  the  drawbacks  of  performance  variables  pointed 
out  above.  (These  advantages  are,  of  course,  purchased  at  the  cost  o. 
increased  complexity  in  measurement.  The  cost  effectiveness  or  tnese 
procedures  is  a  matter  for  future  research  and  will  not  be  discussed  nere.) 
It  is  reasonable  to  assume  that  the  inherent  random  noise  sources  wmch 
perturb  or  contribute  to  the  variability  of  psvchophvsiological  measures 
are  relativelv  uncorrelated  with  the  sources  of  performance  variability. 
Assuming  an  independence  of  the  noise  sources,  the  information  provided 
iointlv  bv  both  signals  in  any  given  observation  period  should  be  ot 
greater  reliability  than  the  information  provided  by  either  source  alone. 

In  ocher  words  the  time  required  for  a  Decision  Center  to  obtain  an  estimate 
of  the  subject’s  internal  state  at  a  given  level  of  reliability  will  be 
reduced  x>rhen  estimates  are  based  upon  joint  measures. 

The  second  advantageous  feature  of  psychophysiological  measi  res  is 
that,  with  proper  instrumentation,  they  may  be  assessed  in  a  manner  tnat 
is  less  disruptive  to  ongoing  performance  than  the  monitoring  probes  or 
the  secondary  tasks  discussed  above.  One  example  of  the  potential  useful¬ 
ness  of  such  measures  has  been  provided  by  another  Biocybernetics  project 
in  x^hich  Beattv  (14)  has  demonstrated  the  utility  of  pupil  diameter  as  a 
measure  of  the  operator ’ s  cognitive  processing  load.  Our  own  erforts  focus 
on  the  scalp  recorded  event-related-potential  (ERP)  as  a  source  wmen  may 
provide  useful  information  concerning  the  operators  cognitive  state. 
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THE  EVENT-RELATED-POTENTIAL 


.  .»ni  f  3CTP  chance  in  the  brain  which  is  elicited  by 
The  ESP  is  a  I transient  by  sut£ace  slsctrod.s  attached 

anv  discrete  event,  en  Y  .  FRP  extends  for  at 

to-  the  scalp.  Superl.posed  o^the-gotngJEG^the  ™  exteads^^  ^ 

^“tlve-colnc'peaks  cltl,  chatactetlstic  latencies  following  the  stl.nln. 

8  T18  amplitude  of  the  different  peaks  appears  to  be  sensitive  to 

tia  lvalue  in  providing  considerable  information  to  an  on-board  computer. 

multitria^averaging  -chnijue^can^to  cancel^  noise.ntribution 
and  P^vide  on  estim  ^  8R^n_Une  assessmenc  of  behavior,  a  wide  variety 
o£.  Criu  -nLnre  being  developed  and  implemented  as  a  part  of  this  project 
°- ict^will  enable  successful  identification  and  classification  of  the  . 
characteristics  of  an  ERP  -a  singl 

ad  j  us tmen t° t^ revea^peaks  (19)  or  application  of  linear  discriminant 
analysis  (20). 

Although  the  success  of  ERPs  as  indicants  of  attention  in  a  complex 

^  '  *  i  j: cni oV  This  finding  has  been  replicated  m  the 

two  element  Visual  pi  7-  components  of  the  waveform  have  also 

anditory  modality  (22  23) -v  Z  ^  ^  ^ _  clearly  chen 

been  found  to  be  en  re£lecC  an-or-none  discrete  shifts  in  attention 

a^de fined  by  the  relevance  or  non-relevance  of  an  information  source. 

There  is  much  less  validation  in  the  literature  of  the  ability  of  the 
ERP  measures  to  distinguish  between  levels  of  workload  or  attention  on  a 
*  or*  traded  basis  in  a  manner  which  has  been  established  with 

continuous  orgra  2  5) .  In  a  study  in  which  ERPs  were  recorded 

To"  t“i  in  a  two'h^ei’signal  detection  task,  Lafayette ,  Dinand  and 
Gentil  (26)  were  able  to  observe  changes  in  the  early  positive  an  nega  ve 
r  uDp  as  the  stimuli  were  made  more  or  less  relevant  by 

=SSsr^  sss  sr* 

secondary  cognitive  reasoning  task  was  manipulated. 
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PRELIMINARY  EXPERIMENTATION 


We  are  currently  investigating  the  capability  of  ERP  measures  to 
reflect  the  subject's  information  processing  characteristics  in  an  environ-^ 
men l  ihal  simulates  more  closely  the  control  and  monitoring  ljj« 
bv  the  pilot.  Our  basic  experimental  approach  is  to  record  ERPs  to  p^o 
auditory  or  visual  stimuli.  These  probes  are  either  irrelevant  to  the 
tasks  performed  (and  may  therefore  be  ignored  by  tne  subjects),  or  requir 
only  a  minimum  amount  of  cognitive  processing,  thus  avoiding  any  dl^uption 
of  primarv  task  performance.  In  a  selective  attention/monitoring  paradigm, 
thePprobe' stimulus  may  occur  along  one  of  the  relevant  channels,  displays 
«  information  sources  to  determine  if  the  elicited  ERP  will  reflect  ne 
extent  to  which  that  source  is  being  processed.  The  stimulus  attribute  < 
the  probe  will  however  be  different  from  the  attributes  of  the  channel  tnat 
S  Levant  to  the  monitoring  task.  That  is,  if  the  subject  is  monitoring 
a  visual  channel  for  a  spatially  defined  event,  the  probe  will  involve  an 
intensitv  change.  Alternatively,  in  a  divided  attention  processing  task, 
tll  ERP-eliciting  stimuli  can  be  the  same  stimuli  as  those  that  are  pro¬ 
cessed  and  responded  to  in  the  performance  of  the  tasks.  Finally,  in  a 
workload  manipulation  paradigm,  the  probes  are  presented  along  channels 
that  are  totally  irrelevant  to  the  primary  task  perrormed,  in  order  to 
determine  if  the  elicited  ERP’s  will  reflect  the  residual  attention  available 
from  that  task  as  its  workload  is  manipulated. 

We  have  investigated  ERP’s  and  residual  attention  in  a  pilot  experiment 
in  which  six  subjects  performed  a  two-dimensional  pursuit  tracking  tasx  wit 
dynamics  of  the  form 

K1  +  K2S 


on  both  axes.  Twelve  3-minute  practice  trials  were  first  presented  to 
bring  the  subjects  to  a  stable  level  of  performance  via  adaptive  techniqu 
with  forcing  function  cutoff  frequ  =y  employed  as  the  adaptive  variable. 

'  ERP’s  were  next  recorded  in  two  workload  manipulation  sessions.  The 
probes  consisted  of  a  Bernoulli  series  of  rare  and  frequent  tones  differing 
in  pitch.  The  two  sessions  differed  from  each  other  according  to  whether 
stimuli  were  ignored  „  processed.  "Processing"  involved  main- 
taining  an  internal  count  of  the  number  of  rare  stimuli  that  occurred 
during  a  trial  (see  Figure  2). 

Within  each  sesr=  on,  workload  was  manipulated  by  two  different  methods 
(Figure  3).  First,  /..a  forcing  function  bandwidth  was  increased  and  . 
decreased  by  30%  from  the  asymptotic  level,  achieved  by  each  subject  in  the 
adaptive  practice  session.  This  generated  3  different  le*el® 
task  difficulty.  Then,  based  upon  the  performance  of  each  subject  m  the 
intermediate  and  high  bandwidth  conditions  (moderate  and  large  error 
respectivelv) ,  two  target  diameters  were  selected,  equal  m  value  to  the 
THIS  error  measured  for  each  subject  in  those  two  conditions.  A  third 
larger  diameter  was  also  selected  of  proportionately  greater  size,  -n  this 
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manner  the  manipulation  of  target  diameter,  or  required  error  tolerance  was 
"calibrated”  for  each  subject  according  to  his  sensitivity  to  the  bandwidth 
manipulations,  the  object  of  this  calibration  being  to  obtain  equivalent 
manipulations  of  subjective  performance  demands  across  all  subjects.  Track¬ 
ing  under  the  three  target  sizes  was  performed  at  the  constant  intermediate 
bandwidth  level.  It  was  therefore  assumed  that  progressively  more  residual 
attention  would  be  made  available  from  the  tracking  task  as  the  error 
tolerance  (and  obtained  error)  was  increased. 

The  two  particular  workload  manipulations  employed  may  be  placed  in 
context  by  assuming  that  the  attentional  demands  or  subjective  difficulty 
of  a  task  is  a  joint  function  of  its  objective  difficulty  (e.g.,  task 
characteristics  such  as  input  bandwidth  or  dynamics),  and  the  perrormance 
level  required  (in  the  present  case,  specified  by  the  target  diameter) . 

What  we  have  done  then  is  to  manipulate  each  of  these  dimensions  of  diffi¬ 
culty  separately,  while  holding  the  other  constant. 

The  logic  of  the  tracking  task,  probe  signal  presentation  and  ERP 
recording  was  controlled  by  a  PDP  11/40  computer  (27).  ERP  s  recorded  from 
3  electrode  sites  were  amplified  and  were  displayed  on  line 

via  a  GT-44  graphics  display  terminal.  The  data  was  also  recorded  on 
digital  tape  for  later  off-line  analysis  and  plotting  by  a  Harris  computer. 

The  preliminary  data  thus  far  collected  has  indicated  that  stable  ERP 
waveforms  can  in  fact  be  elicited  by  probes  under  the  high  levels  of  primary 
task  workload  required  in  the  experiment.  At  this  writing  the  data  are 
being  analyzed  and  it  is  already  clear  that  there  is  a  substantial  degree 
of  individual  differences  in  the  shape  and  temporal  characteristics  of  the 
waveforms.  These  may  well  be  related  to  different  strategies  that  subjects 
adopt  in  performing  their  assigned  tasks,  and  these  strategies  will  be 
investigated  through  future  analysis  of  the  tracking  data  m  tne  time  and 
frequencv  domain.  It  appears  however  that  there  is  some  consistency  within 
the  waveforms  of  a  given  subject.  In  this  case  the  between-suoject 
var iabilitv  presents  no  serious  difficulty  and  may  well  prove  usetul  m 
assessing  individual  differences  in  performance.  In  a  sense,  calibrating 
an  ERP  analyzer  to  the  physiological  response  characteristics  c-t  an  mdividu- 
al  operator  imposes  no  greater  engineering  demands  than  custom  designing  t 
helmet  or  flight  suit  to  his  anthropometric  characteristics. 

At  this  point,  the  state  of  our  research  is  too  preliminary  to  draw 
firm  conclusions  concerning  the  effects  of  attentional  manipulations  upon 
the  event-related-potentials.  However,  given  the  projected  importance  of 
nondisruptive  measures,  and  the  established  sensitivity  of  such  measures 
to  certain  attentional  variations  in  the  laboratory,  we  are  sufficiently 
encouraged  to  pursue  the  direction  of  experimentation  outlined  above. 
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PUPILLOMETRIC  MEASUREMENT  OF  COGNITIVE  WORKLOAD 

By  Jackson  Beatty 

Department  of  Psychology 
University  of  California  at  Los  Angeles 


SUMMARY 


The  momentary  workload  that  is  imposed  by  a  cognitive  task  upon  the  limited 
capacity  human  information-processing  system  appears  to  be  accurately  reflected  in 
the  momentary  level  of  central  nervous  system  activation.  The  utility  of  pupillometnc 
methods  of  workload  assessment  is  evaluated  ancfseveral  lines  of  experimental  evi¬ 
dence  relating  activation  and  cognitive  function  are  reviewed. 


INTRODUCTION 


Information  processing  tasks  differ  in  the  extent  and  duration  of  the  demands 
that  they  place  upon  the  limited  capacity  of  the  human  nervous  system  to  handle  infor¬ 
mation.  For  most  tasks,  processing  demands  are  not  constant,  but  vary  from  moment 
to  moment  in  response  to  changes  in  the  functional  organization  of  the  task.  These  de¬ 
mands  may  be  thought  to  represent  the  cognitive  workload  associated  with  the  task,  a 
time-varying  function  of  the  demand  for  limited  resources. 

Given  the  assumption  that  cognitive  capacity  is  fixed  (reference  1),  the  momen¬ 
tary  demands  of  anv  single  processing  function  for  capacity  may  be  estimated  by  de¬ 
termining  the  amount  of  residual  capacity  that  may  be  allocated  to  another  processing 
task  that  is  assigned  a  secondary  priority  (reference  2).  Secondary-task  measurement 
of  cognitive  workload  is  of  major  importance  in  the  study  of  both  cognitive  capacity 
and  the  resource  demands  of  particular  processes,  out  both  technical  (reference  2)  and 
theoretical  (reference  3)  difficulties  preclude  the  utilization  of  secondary-task  proce- 

Inti  Simons.  For  this  reason  the  more  convenient  tnethod  ot  subjective 
estimation  of  cognitive  workload  is  still  commonly  employed  (reference  4)  despite 
serious  questions  as  to  both  the  reliability  and  validity  of  such  rating  procedures. 

A  third  approach  to  the  problem  of  measuring  momentary  cognitive  workload 
stems  from  the  observation  that  momentary  workload  is  directly  reflected  in  the  mo- 
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mentary  level  of  central  nervous  system  (CNS)  activation  (references  5  and  6).  Of  the 
various  indicators  of  activation,  pupillometric  measurement  techniques  (references 
7,  8  and  9)  appear  to  be  most  sensitive  and  reliable  (reference  10). 

The  present  paper  examines  several  lines  of  evidence  suggesting  that  pupillome¬ 
tric  measures  of  activation  serve  as  a  reliable  indicator  of  cognitive  workload  in  per¬ 
ception,  memory,  decision  and  complex  problem  solving.  An  extension  of  this  experi¬ 
mental  method  to  the  study  of  problems  of  workload  optimization  in  complex  man/ma¬ 
chine  systems  is  then  considered. 


PERCEPTUAL  PROCESSES 


Perceptual  processes  appear  to  proceed  quite  effortlessly  and  place  rather  little 
demand  upon  the  limited  capacity  of  the  human  information-processing  system  (5). 
Thus  Wickens  (reference  11)  was  unable  to  observe  a  secondary  task  decrement  when 
a  sensory  signal-detection  task  was  imposed  as  the  primary  task  in  an  experiment  in¬ 
vestigating  the  distribution  of  processing  capacity.  The  workload  involved  in  the  detec¬ 
tion  of  weak  signals  is  quite  small. 

In  this  context,  it  is  of  interest  to  note  that  small  but  reliable  pupillary  dilations 
accompany  the  detection  of  both  visual  and  acoustic  signals  at  near-threshold  intensi¬ 
ties.  Hakerem  and  Sutton  (reference  12)  examined  the  pupillary  movements  that  ac¬ 
company  the  perception  of  weak  visual  stimuli  and  were  able  to  show  a  dilation  for 
signals  that  were  detected  which  was  absent  for  signals  that  were  missed.  More  re¬ 
cently  Beatty  and  Wagoner  (reference  13)  provided  a  pupillometric  analysis  of  activa¬ 
tion  in  the  detection  of  weak  acoustic  signals  using  a  rating-scale  response  procedure 
(see  reference  14).  Using  unmarked  observation  intervals,  no  pupillary  dilations  were 
observed  in  the  absence  of  a  signal  regardless  of  the  outcome  of  the  observer’s  deci¬ 
sion.  In  the  presence  of  a  signal,  a  dilation  of  the  pupil  appeared  in  the  interval  be¬ 
tween  signal  delivery  and  response  cue  onset.  The  magnitude  of  this  dilation  varied 
monotonically  with  the  observer’s  rated  probability  that  a  signal  had  been  presented. 

These  data  raise  the  interesting  possibility  that  pupillometric  methods  may  pro¬ 
vide  a  more  sensitive  measure  of  cognitive  load  than  do  conventional  secondary-task 
measurement  techniques.  Thus  the  small  pupillary  dilations  observed  during  percep¬ 
tual  processing  may  be  indexing  brain  workload  levels  that  are  not  of  sufficient  mag¬ 
nitude  to  be  detected  by  secondary  task  interference  methods. 
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DECISION  PROCESSES 


Even  simple  decision  processes  appear  to  impose  some  workload  on  the  cogni¬ 
tive  system  as  indicated  by  pupillometric  measures  of  activation.  For  example, 
Simpson  and  Hale  (reference  15)  measured  pupillary  diameter  in  two  groups  of  sub¬ 
jects  who  were  required  to  move  a  lever  to  one  cf  four  positions,  hi  the  decision 
group,  subjects  were  told  at  the  beginning  of  each  trial  that  either  of  two  directions 
was  permissible  (e.g. ,  front  or  left).  Seven  seconds  later  a  response  cue  was  pre¬ 
sented  and  the  subject  initiated  one  of  the  two  movements,  hi  the  no-decision  control 
group,  subjects  were  instructed  exactly  as  to  the  desired  movement  on  each  trial 
(e.g. ,  front).  Pupillary  dilation  in  the  post- instruction  pre-response  period  was  lar¬ 
ger  and  more  prolonged  for  those  subjects  who  had  to  choose  between  two  movements 
before  responding. 

Substantially  larger  pupillary  dilations  are  observed  to  accompany  more  diffi¬ 
cult  decision  processes.  In  an  experiment  reported  by  Kahneman  and  Beatty  (refer¬ 
ence  16),  listeners  were  required  to  determine  whether  a  comparison  tone  was  of 
higher  or  lower  pitch  than  the  standard.  Clear  pupillary  dilation  occurred  in  the  4- 
second  decision  period  between  presentation  of  the  comparison  tone  and  the  response 
cue.  The  amplitude  of  this  dilation  varied  as  a  direct  function  of  decision  difficulty, 
the  difference  in  frequency  between  the  standard  (850  Hz)  and  comparison  tones. 

This  relation  is  shown  in  figure  1,  which  presents  both  the  amplitude  of  dilation  in 
the  decision  period  and  the  percent  decision  errors  as  a  function  of  the  frequency  of 
the  comparison  tone.  These  dilations  were  highly  reliable  and  did  not  habituate  over 
the  experimental  session.  Pupillary  dilations  during  decision  appear  to  vary  as  a 
function  of  cognitive  workload,  as  inferred  from  task  parameters  and  performance 

data. 


MEMORY  PROCESSES 


The  idea  that  human  information-processing  capacity  is  limited  arose  directly 
from  the  study  of  the  limitations  of  human  short-term  or  working  memory  (reference 
17).  Our  capacity  for  unrelated  items  is  on  the  order  of  seven  or  eight,  with  some 
adjustment  being  made  for  the  difficulty  of  the  to-be-remembered  units.  If  pupillary 
movements  reflect  CNS  activation  shifts  as  a  function  of  cognitive  workload,  then 
these  relations  should  be  clearly  revealed  in  the  pupillometric  investigation  of  mem- 
ory  processes. 

Kahneman  and  Beatty  (reference  18)  provided  a  demonstration  that  the  momen- 
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Figure  1.  Average  pupillary  dilation  during  the  decision  period  and  percent  errors 

as  a  function  of  the  frequency  of  the  comparison  tone.  The  frequency  of  the 
standard  was  850  cps.  (From  Kahneman  &  Beatty,  1967) 
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tary  load  placed  upon  the  cognitive  system  by  a'  memory  task  is  reflected  in  pupillary 
diameter,  to  a  series  of  experiments  on  short-term  serial  memory  using  paced  re¬ 
call,  students  were  required  to  listen  to  strings  of  from  one  to  seven  items  and,  after 
a  2  second  pause,  repeat  the  string  at  the  rate  of  one  item  per  second.  For  strings  of 
digits,  pupillary  diameter  increased  as  each  item  of  the  input  string  was  heard  and 
decreased  as  each  item  of  the  output  string  was  spoken.  Thus  pupillary  diameter  at 
the  pause  between  input  and  output  varied  as  a  monotonic  function  of  the  number  of 
items  held  in  memory.  These  pupillary  functions  are  shown  in  figure  2A. 

Workload  in  a  memory  task  depends  not  only  upon  the  number  of  items  to  be  re¬ 
membered,  but  also  upon  the  difficulty  of  each  of  the  items  themselves.  Thus,  as 
fewer  unrelated  words  may  be  reliably  recalled  than  unrelated  digits,  the  load  im¬ 
posed  by  each  word  upon  the  cognitive  system  is  presumed  to  be  greater.  Figure  2B 
presents  the  results  of  a  serial  memory  experiment  involving  strings  of  four  digits 
or  four  words.  For  the  simple  recall  conditions,  it  is  apparent  that  the  slope  of  the 
pupillary  function  is  greater  for  the  more  difficult  word  strings  than  for  the  easier 
digit  strings.  That  these  pupillary  response  functions  are  sensitive  to  processing  pa¬ 
rameters  is  evident  from  the  large  dilations  observed  under  the  condition  labelled 
"transformation,"  in  which  the  subject  was  required  to  respond  to  the  string  of  4  di¬ 
gits  with  another  string  obtained  by  adding  1  to  each  digit  of  the  input  string.  This 
transformation  task  is  the  most  difficult  of  all  memory  tasks  studied,  as  indicated  by 
the  error  data,  and  it  consistently  was  accompanied  by  larger  pupillary  movements 
indicating  CNS  activation. 

Behavioral  data  supporting  the  contention  that  the  demands  upon  limited  infor¬ 
mation-processing  capacity  increase  during  the  rising  phase  of  the  pupillary  re¬ 
sponse  function  as  items  are  entered  into  working  memory  and  decrease  during  the 
falling  phase  of  that  function  as  items  are  successively  recalled  from  memory,  is 
provided  by  an  experiment  in  which  residual  capacity  was  measured  using  secondary- 
task  measurement.  Kahneman,  Beatty,  and  Pollack  (reference  19)  reported  that  the 
pattern  of  interference  with  a  secondary  perceptual-detection  task  exactly  paralleled 
the  pupillary-activation  curve  obtained  for  the  serial  memory  transformation  task 
alone.  For  serial  memory  tasks,  changes  in  cognitive  workload  appear  to  be  reflec¬ 
ted  in  the  momentary  level  of  CNS  activation,  as  indexed  by  pupillometric  measure- 

ment. 


COMPLEX  PROBLEM  SOLVING 


Pupillary  dilations  accompanying  complex  problem  solving  appear  to  be  related 
directly  to  the  difficulty  of  such  processing,  although  behavioral  assessments  of 
workload  have  not  yet  appeared  for  these  types  of  cognitive  tasks.  For  example,  Hess 
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Figure  2. 


Upper  graph;  Average  pupillary  diameter  during  presentation  and  recall  of 
strings  of  3  to  7  digits,  superimposed  about  the  two  second 
presentation  and  recall.  Slashes  indicate  the  beginning  and  the  end  of  the 
memory  task.  Lower  graph:  Pupillary  diameter  during  presentation  and 
recallof  four  digits,  words  and  a  digit  transformation  task.  (From  Kahne- 
man  &  Beatty,  1966) 
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aftd  Polt  (8)  examined  pupillary  movements  as  multiplication  problems  were  solved 
mentally.  Pupillary  diameter  increased  during  the  period  preceding  solution,  and  the 
magnitude  of  dilation  was  related  to  presumed  problem  difficulty.  Payne,  Parry,  and 
Harasymiw  (reference  20)  also  report  a  monotonic  relation  between  mean  pupillary 
diameter  and  problem  difficulty,  but  note  that  this  relationship  is  markedly  nonlinear 
with  respect  to  difficulty  scales  based  upon  percent  correct  solution,  time  to  solution 
or  subjective  rating  of  difficulty.  Pupillary  diameter  in  mental  multiplication  appears 
to  peak  rapidly  as  a  function  of  difficulty,  with  more  difficult  problems  requiring 
more  time  until  solution  is  reached.  This  suggests  that  cognitive  capacity  is  quite  ful¬ 
ly  taxed  in  complex  mental  arithmetic  problems  so  that  the  workload  per  unit  time  re¬ 
mains  relatively  constant  as  problem  difficulty  is  increased  over  moderate  levels,  but 
that  the  total  time  to  solution  is  increased. 

Other  types  of  complex  problem  solving  tasks  show  similar  relationships  be¬ 
tween  pupillary  dilation  and  problem  difficulty.  For  example,  Bradshaw  (reference 
21)  has  reported  that  larger  pupillary  dilations  accompany  the  solving  of  more  difficult 
anagrams,  and  that  these  dilations  are  mainlined  until  solution  is  reached. 


IMPLICATIONS  FOR  WORKLOAD  EVALUATION  IN  MAN/MACHINE  SYSTEMS 


Traditional  interference  and  subjective-rating  methods  of  workload  evaluation 
have  been  employed  in  the  design  of  complex  man/machine  interfaces,  but  neither  is 
without  its  own  particular  limitations.  Pupillometric  methods  of  workload  estimation 
provide  a  third  alternative  that  in  certain  situations  might  be  preferable  to  either  of 
the  more  traditional  measurements. 

One  problem  for  which  pupillometric  assessment  procedures  appear  to  be  well- 
suited  is  that  of  display  evaluation.  Pupillometric  methods  permit  reliable  measure-  . 
ment  of  the  small  cognitive  workloads  associated  with  the  processing  of  sensory  infor¬ 
mation  that  may  not  be  detectable  by  interference  methods.  One  project  underway  in 
our  laboratories  examines  the  effects  of  display  readability  on  the  pupillary  dilations 
accompanying  information  acquisition.  A  second  experiment  is  concerned  with  pupil- 
lometrically  measuring  cognitive  workload  involved  in  processing  computer-generated 

speech  at  various  levels  of. intelligibility.  «•  ... 

The  most  intriguing  possibility  is  that  the  measurement  of  central  nervous  sys¬ 
tem  activation  associated  with  cognitive  function  might  provide  a  common  metric  for 
the  comparison  of  workload  in  tasks  that  differ  substantially  in  their  functional  char¬ 
acteristics.  Underlying  this  possibility  is  the  idea  that  CNS  activation  is  the  limited 
general  resource  that  is  allocated  among  cognitive  processes  demanding  capacity.  If 
this  is  the  case,  then  it  may  be  possible  to  directly  compare  perceptual,  memory, 
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cinnhol  manipulation  and  response  processes  in  terms  of  activation  requirements.  At 
present  however,  we  may  only  conclude  that  the  pupillometric  measures  «  activation 
are  useful  in  measuring  cognitive  load  for  a  range  of  cognitive  processes.  No  evi¬ 
dence  concerning  the  comparability  of  measurements  made  across  diverse  processes 

has  yet  appeared. 
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LEFT- BRAIN/ RIGHT-BRAIN  AND  SYMB0LIC/ANAI.OGIC 

HUMAN  OPERATOR  OUTPUT  compatibility  • 
By  William  L.  Verplank 
Massachusetts  Institute  of  Technology 


SUMMARY 


This  p,p«  introduces  the  terns  S^=ii|  '““U^Svices 

and  analogic  (e.g.f  oerator  (h.o.)  and  computer.  With  increas- 

for  I^^ion  between  human ‘^machine  comunication  to  be¬ 
ing  automation,  ' is  a  danger  that  the  interface  wilx  con- 
come  increasingly  symbolic.  -  -  linking  of  the  human  operator.  To 

Si"  SSLS- JSSTLS SfdSu^”^ 

:rs»^:Mt?r^r^°r:;d  -  « 

split^-brain  patients.  v  '  ' 


COMPUTER-AIDED  MANIPULATION 


.  o^hilitv  of  any  h.o. -computer  system  is  strongly 
The  structure  and  capability  of  V  interface.  I  considered  that 

constrained  by  the  form  ®  ^designing  a  system  for  supervisory  control 

interface  as  the  first  ^f^^^^sory  control  concept  was  first  sug- 
of  remote  manipulation  ^nd  sheridan  [3],  as  a  means  for  overcom- 

— 'S.1 difSdlW  manipulation  «ith  trannai.sion  delay,  (saa 

figure  1.) 


The  Subroutine  Approach 

The  particular  foist  of  supervisory  -ntrol^onsi^as^f 
viding  the  manipulator  with  the  «pa  i j.  -arm-  program  at  Stanford 

routines  or  subtasks  For  that  lowers  the  jaw  around  a 

Research  Institute  [4]  has  a  GRD  (figure  2).  This  subroutine  would, 

block  using- a  touch  ^"^^fi^ting  blocks .  Some  parts  of  the  task 
be  part  of  a  larger  tart,  sue  aas  < *££«ter-control .  rnese  can  be  organized 

commands  22^^^ through  the'teletype.  Programing  simple  motions 
consuming  and  laborious. 


* 
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Command  Hardware 

If  the  h.o.  is  to  communicate  am  positions  directly,  the  most  appro¬ 
priate  form  of  command  might  be  "master-slave"-like  control  with  a  brace  or 
replica.  On  the  other  hand,  there  may  be  occasion  for  the  h.o.  to  indicate 
direction,  and  the  computer  to  modify  the  path,  to  avoid  obstacles  or  simply 
to  stop  when  it  touches  something.  This  seems  the  most  elementary  form  of 
supervisory  control,  and  would  be  quite  useful  with  degraded  or  delayed 
feedback  from  a  remote  manipulator.  The  brace  seems  less  appropriate  at  this 
level,  for  one  thing,  because  there  can  no  longer  be  always  a  one-to-one 
correspondence  between  the  master  and  slave  which  requires  re— setting  the 
correspondence  every  time  control  is  transferred  from  computer  to  h.o.  At 
this  level,  some  form  of  joystick  providing  rate-control  seems  more  appropri¬ 
ate.  G.  Starr  [6]  is  doing  a  detailed  study  of  rate  control;  supervisory 
control  is  a  potential  application. 

At  a  higher  level,  any  frequently  used  subroutine  could  be  called  by 
the  h.o.  pushing  a  button  or  flipping  a  switch  (e.g.,  “STOP  IF  YOU  TOUCH'*, 
or  "GROPE”).  Strings  of  subroutines,  or  new  programs  would  be  more  appropri-  . 
ately  input  through  a  special  purpose  or  standard  keyboard.  Voice  seems  an 
appropriate  command  mode  also,  especially  because  it  can  be  combined  with 
others?  for  example,  "SLOW"  might  lower  the  gains  on  a  rate  control?  "EASY" 
might  adjust  the  servo  to  produce  compliance?  "CAREFUL"  might  increase  the 
sensitivity  of  touch  or  proximity  sensors. 


Symbolic  and  Analogic 

One  way  to  describe  this  range  of  command  modes  is  as  running  from  ana¬ 
logic  (brace,  joystick)  to  symbolic  (button,  keyboard,  voice)  along  a  conti¬ 
nuum  of  abstraction.  It  seems  important  to  provide  a  range  of  command  modes 
in  order  that  the  system  be  accessible  at  different  levels,  possibly  at  two 
levels  simultaneously.  For  example:  "move  in  this  direction"  (ANALOGIC), 

"but  stop  if  you  touch  something"  (SYMBOLIC) • 

-  This  division  of  control  or  command  hardware . into  symbolic  and  analogic 
suggests  ttfe  parallel  between  computer-aided  manipulation  {c.A.M.Tand  compu¬ 
ter-aided  design  (C.A.D.)  (figure  4).  In  both  cases  it  seems  most  appropriate 
to  use  both  symbolic  and  analogic  controls  perhaps  simultaneously. 


VISUAL  THINKING 


My  teaching  experience  in  the  Design  Division  at  Stanford  University  has  * 
made  me  more  aware  of  the  importance  of  this  distinction  between  symbolic  and 
analogic  modes  and  their  relationship  to  thinking.  Robert  McKim  [7]  has  dev¬ 
eloped  a  drawing  and  design  course  that  focuses  on  rapid  visualization  (free¬ 
hand  sketching)  and  the  process  of  design.  The  philosophy  of  the  course  (and 
what  Lakes  it  so  popular)  is  that  productive  thinking  of  all  sorts  requires 
fluency  in  not  only  analytical,  verbal  modes  (on  which  most  education  Js 
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focused)  but  else  in  the  sensory,  visual  .ode.  (in  which  our  education  is 
severely  leckinc) • 

What  is  visual  thinking?  To  give  you  an  experience  in  what  visual 
thinking  is,  here  is  a  problem  to  solve  from  Me Kim  s  book: 

-•Painted  Cube:  Shut  your  eyes.  Think  of  a  wooden  cube 
such  as  a  child's  block.  It  is  painted.  Now  imagine 
,-hat  you  take  two  parallel  and  vertical  cuts  through  the 
cube  ^dividing  it  into  equal  thirds.  Now  take  two  addi¬ 
tional  vertical  cuts,  at  90“  to  the  first  ones,  hiding 
the  cube  into  equal  ninths.  Finally,  take  two  parallel 
and  horizontal  cuts  through  the  cube,  dividing  it  into 
27  cubes.  Now,  how  many  of  these  small  cubes  are  painted 
on  three  sides?  On  two  sides?  On  one  side?  How  many 
cubes  are  unpainted? " 

lc  romrt  cuite  clear  use  of  some  form  of  visual  imagery  in 
Mostpeople  pot,^bolic  mathematical  approach  would  be  cumbersome 
counting  the  cubes.  ^  unique  features  of  the  Visual  Thinking  course 

i5d" experiential  zeroises  for  the  mind's  eye''.  Mental  manipulation  of 
mind's-eye  imagery  improves  with  practice. 

Two  Modes  of  Thinking 

,  v,pon  a  distinction  made  between  ways  of  thinking  and 

There  has  ^ongbeen  a  that  seem  to  correlate.  My  own 

great  lists  can  be  mad  h  importance  to  problem  solving,  what  has  been 

list  is  given  ink^g  is  the  ability  to  move  freely  from  one 

referred  to  as  --eativ  “  insight-  often  comes  through  visual  or  sen- 

£?  SagTry  SreTre  Serous  accounts  of  this  process,  the  most  recent 

is  by  T.H.  Krueger  [81. 

Split-Brain  Studies 

^  I'f.iMl  evidence  on  split-brain  human  patients  (with  hemispheres 

T dntoCreUeve  epilepsy)  has  revived  interest  in  the  differing  roles 
separated  to  relieve  ep  p  y  rq  -.q  ill  The  left  hemisphere 

of  the  tight  is  usually  dominant  and  'controls  speech.  The 

“hfbtSnf  Sre  with  sensory  end  spatial  thinking.  whether 

-hot  the  dichotomy 

knowing  and  thinking • 

ANALOGIC  PART  PROGRAMMING 


David  Gossard,  at  M.I.T.,  has  recently  applied  this  notion  of  the 
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'symbolic  and  analogic  modes  to  preparing  programs  for  numerically  controlled 
machine  tools  [12].  Conventionally,  a  highly  trained  programmer  prepares  a 
Program  in  a  symbolic  language  (e.g.,  APT)  directly  from  the  drawings  of^the 
Dart  (figure  6a) .  "his  program  is  compiled  to  produce  an  ideal  tool-path 
(cutter  location  or  C/L  data)  which  must  be  verified.  A  post-processor  com- 
nutes  the  commands  necessary  to  making  the  particular  machine  tool  follow  tnat 
patlT  Gossard's  svstem  (figure  6b)  uses  computer  graphics  with  a  picture  of 
the  work-piece,  tool  and  clamps.  The  programmer  uses  knobs  (analogic)  and 
keyboard  (symbolic)  to  move  the  tool  through  the  appropriate  paths.  The 
computer  can  record  and  play  back,  straighten  out,  or  interpret  symbolically 
(e.g.,  circle)  paths  put  in  anaologically  by  the  operator.  The  computer  then 
outputs  the  C/L  data  directly. 


Gossa-d's  experience  with  both  part  programmers  and  machinists  untrained 
in  part-programming  points  up  the  advantages  of  using  computer  inputs  compati¬ 
ble  with  thinking  modes.  In  three  hours  the  machinists  were  able  to  produce 
part  programs  equivalent  to  what  they  would  have  needed  three  months'  train¬ 
ing  to  write.  Even  the  trained  programmers  preferred  the  analogic  mode.  One 
of  the  unexpected  advantages  commented  on  by  the  users  was  the  ability  to  in¬ 
stantly  see  and  to-avoid  problems  of , interference  between  tool -and  clamps. 

The  programing  of  complicated  spatial  motions  and  relationships  is  a  process 
that  clearly  benefits  from  the  ability  to  use  both  analogic  and  symbolic  com¬ 
mand  modes . 


CONCLUSION 


With  increasing  automation,  there  is  a  tendency  for  man-machine  commu¬ 
nication  to  become  increasingly  symbolic.  This  is  natural  and  efficient  since 
complete  processes  can  be  commanded  by  simple  names  or  labels.  There  is,  how¬ 
ever  the  danger  that  the  interface  will  constrain  the  thinking  of  the  human 
operator  to  symbolic  modes.  The  human  operator  also  has  special  abilities  in 
visual-spatial  thinking.  To  allow  both  modes  of  thinking,  the  man-machine 
interface  should  provide  for  simultaneous  access  over  a  range  of  control  from 
symbolic  to  analogic. 
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SU PEKVtSO^V  CPMTKOL 


Figure  1.  The  supervisory  control  concept 
introduced  by  Ferrell  and  Sheridan  [2,3] 


for  remote  manipulation  was 


5  U  &  SOUTINE 


•‘LEFT"  !NC(>I  ,t0) 


Figure  2.  Subroutine  "GROPE"  uses  touch  sensors  on  finger  and  thumb  to 
center  and  lower  the  jaw  around  an  object. 

ORIGINAL 
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Figure  3.  A  task  hierarchy  for  picking  up  blocks  shows  the  interaction  of 
h.o.  and  computer  on  a  variety  of  levels.  The  command  hardware  appro¬ 
priate  at  different  levels  is  shown  on  the  right. 


5YM&OUC*  COHTZ0L  ^MALQ6lC^  CONTCOl 

Figure  5.  The  dichotomies  of  thinking  modes 
4b.  Proposed  Command  Hardware  for  are  associated  with  each  other  and 

Computer-aided  Manipulation  (C.A.M.)  th^  hemispheres  of  the  5>rain. 


Figure  6a.  Conventional  part  programming  for  numerically  controlled  (N/C) 
9  machine  tools  requires  a  part  programmer  trained  in  a  highly  symbolic 
computer  language. 


ft#  WT/I* 

ft  -I*  '**/«♦* 

frl  .u4/fi,« «*| 


f  (C06BA  M/COMPI LEP* 


Fiaure  6b.  Analogic  Part  Programming  [12]  uses  computer  displays  and 
T  LllWio  (knobs)  .nd  sybolic  (k.ybo.rd)  »od.s  for  .ore  effr=r«.t 
programming  (less  training,  better  programs).  There  appears  to  be 
better  compatibility  between  thinking  modes  and  human  operator  o ^ put 
^right-brain  vs.  left-brain  and  spatial  vs.  linguistic  and  analogic 

vs.  symbolic) . 
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A  REGRESSION  APPROACH  TO  GENERATE  AIRCRAFT  PREDICTOR  INFORMATION 
By  Paul  D.  Galiaher,  Roger  A.  Hunt,  and  Robert  C.  Williges 
University  of  Illinois  at  Urbana-Champaign 


SUMMARY 


A  predictor  display  shows  the  human  operator  future  consequences  of  his 
immediate  control  inputs.  A  contact  analog  aircraft  display  is  described  in 
which  an  airplane-like  predictor  symbol  depicts  future  airplane  position  ;  nd 
orientation.  The  standard  method  for  obtaining  the  predictor  information  is 
to  use  a  complete,  fast-time  model  of  the  controlled  vehicle.  An  alternative 
approach  is  presented  in  this  paper  in  which  least-squares,  first-order, 
linear  approximations  for  each  of  the  six  degrees  of  freedom  of  aircraft 
motion  were  calculated.  Thirteen. variables  representing  -changes  in  positions 
and  rate  of  change  of  positions  were  selected  as  parameters  for  the  prediction 
equations.  Separate  sets  of  equations  were  determined  for  7,  14,  and  21 
seconds  prediction  times  and  continuous,  1,  and  3  seconds  control  neutraliza¬ 
tion  assumption  times.  The  advantages  and  disadvantages  of  this  regression 
approach  are  discussed. 


INTRODUCTION 


Predictor  displays  provide  the  operator  of  manually  controlled  systems 
with  information  about  the  future  state  of  the  variable  being  controlled. 

Often  this  information  can  be  generated  by  an  analog  of  the  system  to  be 
controlled,  operating  repetitively  in  an  accelerated  time  scale.  Ideally,  to 
generate  a  predictor  model  using  such  a  fast-time  model,  the  model  should  be 
a  duplicate  of  the  original  plant.  For  example,  to  put  a  predictor  display 
in  an  aircraft  trainer  which  uses  an  analog  computer  for  all  flight  equations- 
and  dynamics,  a  second  analog  computer  just  like  the  first  with  speeded-up. 
time  constants  could  be  used.  Such  complexity  in  using  an  accurate  fast-time 
model  imposes  a  penalty  of  cost,  computer  weight,  and  power  requirements.  In 
fact,  Kelly  (l)'pointed  out  that  it  may  not  be  necessary  to  have  the  complete 
accuracy  of  a  fast- time  model. 

.  «. 

Bemotat  (2)  used  a  Taylor  series  expansion  rather  than  the  "fast-time 
model  approach,  and  found  that  even  inaccurate  predictions  gave  improved 
performance  over  no  prediction  in  the  control  of  a  third-order  undamped  system 
following  a  step  input.  Kelley  (3)  found  the  same  effect,  but  he  also  found 
that  the  useful  prediction  span  decreased  with  model  accuracy  while  learning 
times  for  effective  manual  control  were  increased.  A  comprehensive  study  of 
simplified  models  for  an  automatic  predictive  control  system  for  aircraft 
landing  in  two  dimensional  sideways  looking  displays  was  conducted  by 
Chestnut,  Sollecito,  and  Troutman  (4).  They  pointed  out  that  the  model  may 


be  of  either  the  analog  or  digital  form,  but  they  felt  the  digital  approach 
offers  more  accuracy  and  flexibility.  They  also  indicated  that  the  time 
constants  and  gains  of  the  model  can  be  in  error  by  two  to  one  without  exces¬ 
sive  loss  in  performance. 

The  main  effect  of  an  inaccurate  model  is  closely  related  to  the  predic¬ 
tor  span.  The  magnitude  of  the  errors  in  an  inaccurate  predictor  can  be 
determined  analytically  or  experimentally  if  the  plant  can  be  observed 
directly  or  simulated  accurately  for  comparison  with  a  less  accurate  fast- 
time  model.  Errors  farther  into  the  future  are  usually  compensated  for  by 
the  fact  that  accuracy  requirements  on  short  predictions  usually  are  greater 
than  for  long  predictions.  Predictor  displays  can  also  overcome  the  problem 
of  accuracy  when  they  are  continuously  updated.  If  updating  is  inaccurate  or 
infrequent,  the  fast-time  model  must  be  that  much  more  accurate. 

This  paper  presents  a  least-squares,  regression  approach  for  determining 
first-order,  linear  approximations  of  accurate  fast-time  models .used  in 
predictor  displays.  Such  a  procedure  would  eliminate  the-need-¥or  an' opera¬ 
tional  fast-time  model  while  still  providing  a  great  deal  of  predictive 
accuracy.  The  accuracy  of  this  regression  approach  for  generating  these 
predictor  symbols  is  evaluated  both  at  various  prediction  times  and  at  various 
control  input  durations. 


METHOD 


Task 


For  the  purpose  of  demonstrating  the  use  of  a  regression  approach  to 
ggj.£^ate  predictor  information,  an  application  incorporating  predictor 
information  in  an  aircraft  system  during  an  approach  to  landing  was  used. 
Because  of  the  complexity  and  sluggishness  cf  the  aircraft  system  in  the 
landing  phase,  manual  performance  depends  heavily  on  the  anticipatory 
abilities  of  the  pilot.  Under  such  circumstances,  predictive  displays  might 
be  very  useful.  Smith,  Pence,  Queen,  and  Wulfeck  (5)  demonstrated  that  the 
predictor  display  did  improve  performance  in  an  approach  to  landing  on  an 
aircraft  carrier.  It  even  facilitated  learning  to  such  an  extent  that  mean 
performance  on  transfer  trials  using  a  predictor  was  considerably  higher  than 
that  of  a  control  condition  without  the  predictor.  ^ 


The  specific  approach  to  landing  task  in  t  lis  study  was  generated  for  a 
Singer-Link  General  Aviation  Trainer  (GAT-2)  which  simulates  general,  light, 
twin-engine  aircraft.  The  predictor  symbology  was  incorporated  into  a 
versatile  computer-generated  dynamic  flight  display  developed  by  Artwick  (6) 
and  was  part  of  an  integrated,  vertical  situation  display  stylized  in 
Figure  1.  In  addition  to  the  situational  information  of  runway  outline, 
centerline,  touchdown  zone,  and  grid-line  ground  texture  cues,  three  glide- 
slope  indicators  in  the  form  of  telephone-pole-shaped  symbols  and  three 
discrete,  airplane-like  predictor  symbols  are  shora  on  the  display.  The 
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predictor  symbols  represent  the  position  of  the  aircraft  at  three  particular 
future  points  in  time  (7,  14,  and  21  seconds  as  used  in  this  study)  given  a 
specified  control  input  by  the  pilot. 


Figure  1.  Stylized  representation  of  an  integrated  vertical  situation 
display  showing  three  aircraft-like  predictor  symbols. 


Regression  Procedure 

To  generate  the  predictor  symbols  shown  in  Figure  1,  one  must  specify, 
the  changes  in  the  six  degrees  of  freedom  of  aircraft  motion  as  listed  in 
Table  1.  Each  of  these  six  degrees  of  freedom  are  determined  by  the  specific 
flight  dynamics  of  the  aircraft.  These  dynamics  are  specified  in  terms  of 
complex,  higher-order  differential  equations  which  represent  position,  •«  ; 
change  in  position,  and  rate  of  change  of  position  as  shown  in  Table  2. 

(These  values  are  all  accessible  as  millivolts  the  GAT-2  analog  computer.) 

Rather  than  use  the  complete  set  of  complex  flight  equations,  a  first- 
order  linear  approximation  may  suffice  particularly  in  the  limited  range  of 
variables  encountered  in  a  final  approach  to  landing  situation.  A  standard, 
least-squares,  multiple  linear  regression  analysis  (Tatsuoka,  7)  can  be 
used  to  estimate  a  raw-score,  linear  approximation  of  the  general  form, 

Y  =  B  +  3-  X.  +  .  .  .  +  B  X  (1) 

oil  mm 
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TABLE  1 

Changes  in  Six  Degrees  of  Freedom  of  Aircraft  Motion  Required  to  Specify 
Aircraft  Predictor  Symbology 

Degrees  of  Freedom 

Change  in  Bank  (Au  ) 

B 

Change  in  Yaw  (A0^) 

Change  in  Pitch  (A9p) 

Change  in  Lateral  Position  (Ax> 

Change  in  Vertical  Position  (A^) 

Change  in  Longitudinal  Position  (A^) 

^  ...  -  X 

■ 

TABLE  2 

Initial  Variables  Used  to  Predict  Changes  in  Six  Degrees  of  Freedom  of 
Aircraft  Motion 

Predictor  Variables 
Aileron  Position  (a) 

Rudder  Position  (c) 

Elevator  Position  (e) 

Throttle  Position  (t) 

*  Bank  Angle  (0B) 

Yaw  Angle  (9y) 

Pitch  Angle  (6p) 

\ 

Cosine  Bank  (cos  9^) 

’  .Rate  of  Roll  (6^) 

Rate  of  Pitch  (0p) 

Rate  of  Yaw  (6Y> 

Rate  of  Climb  (R/C) 

Velocity  (v) 


159 


X 


£' 


where  Y  represents  the  dependent  variable,  ^  through  X  the  independent 
.  ,  ,  ariH  a  throueb  6  the  partial  regression  coerf icients .  Specif i 
cally.  the  general  form  of  the  Equation  1  for  the  predictor  symbology  case  is, 


A  degree  of  freedom 


6q  +  31  a  +  B2  p  +  £3  £  +  S4  3g 
+  B5  9y  +  S6  9p  +  B7  cos  9b  +  3g  9b 

+  b9  9p  +  e10§Y  +  6n  R/C 


(2) 


where  Y  is  replaced  by  the  particular  change  in  degree  of  freedom  of  interest 
Tel's  are  replaced  by  selected  variables  in  Table  2,  and  the  3's  represent 
the  raw-score,  partial  regression  weights  which  are  empirically  determine  . 

All  the  independent  variables  except  velocity  and  throttle  can  take  on 

both  positive  and  negative  values.  Velocity  and  throttle  are  always  zero  or 
both  positive  anu  s  contribution  to  the  predictor  equation  would 

some  P°sitivevau  ,  Furcherjnore>  velocity  and  throttle  changes  should  amplify 

rioducid  by  control  siirfrde-fssitidn  and  airplane  position  changes. 

position  as  well  as  the  cn  a[rfthro«le  val„es  of  Che  CAT-2.  The  re.aln- 

lng”o“  vaPrlablesyin  Equation  2  already  contain  velocity  and  throttle  infor- 
mation,  because  they  are  rates  of  change  m  position. 

Data  Collection  Procedure 

Of  the  thirteen  independent  variable,  sheen  in  Table  2.  »aly  the  changes 
¥  t-Uo  j-hrpp  control  surfaces  (rudder,  aileron,  and  elevator 
4”  m  ’t  t£«irSi«  can  be  directly  affected  by  the  pilot. 

El rifStSS  SSSST  “F^S^f^o^JaSHr^erli^ct 

g  j  4-  r-*.  opnprArine  the  regression  equations.  A  one-third  replica 

necessary  data  for^g^  of8these  four  variables  was  observed  twice 

clon. .  .  5-  daca  collection  flights.  The  remaining  nine  variables  were 

considered^o^be^approximately  randL  and  .era  nor  manipulated  through  expert- 
menter  control. 

Durina  each  of  these  54  data  collection  cycles  the  GAT-2  was  rlown  m  an 
approach  to.  landing  configuration.  The  landing  Saar  vas  dovn^.nd^the  proper 

C«Tw«  f loL'by'Se 'p“otftldt£rrroper  taking  configuration  the  Raytheon 

7M  'dTthe  SSffSlS:  Tin  t“i:Lr.nd~:bLsTho^,  in 
"t ITT  2  “  d  SHhLges  in  the  six  degrees  of  freedom  of  «tlo» 

laoie  z,  an  7  -.a  ,nd  ?1  seconds.  These  latter  values 

provided^he^hre^prediction'  times  represented  by  the  successive  discrete 

predictor  symbols  shown  in  Figure  1. 
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To  simulate  the  four  designated  control  surface  positions  over  different_? 
flights,  the  ^  surfac  impositions  ™  in^ed^^  th°eir  use  ‘  ‘ 

r*  “'•E'S.Si.0  s  ^rr 

changes  m  ch«  -8_  neu  slgnals  „„  sent  through  the  dlgital-to-analog 
converter  and'into  the  OAT-2  computer  to  maintain  precisely  a  given  set  of 
control  movements. 

,  Tah1p  3  the  factorial  design  of  this  study  also  allowed  for 

As  ^own  in  Table  3,  the^ta^  ^  ^  each  Qf  three  control  assump- 

the  calculation  of  six  p  ^  prediction  times.  The  length  of  time 

ton  Control  surface1 changes  were  maintained  prior  to  neutralization  deter- 
these  control  s  “  J  cimes.  ^en  the  control  changes  were  maintained 

mined  the  control  as  P  prediction  span,  this  produced  the  continuous 

continuously  over  If  tke  control  changes  were  not 

throughput  the  data  collectioc  phase,  an  off-line  Ptedicty  mpdel  , 
is  used/  Two  Off-line  models  using  control  assumption  times  of  1 
seconds  were  also  investigated  in  this  study.  A  different  set  o.5 
approaches  to  landing  were  required  for  eacn  control  assumption  time, 
sequently ,  a  total  of  162  approaches  were  measured. 

TABLE  3 

32  SETS  ”"«domSof  ST3“ 


LUK  L  LCUIULua  ~  j  - 

rv>ntrol  Assumptions 
'Seconds) 

Prediction  Times  (Seconds) 

7 

14 

21 

Continuous  (21) 

(6  Regression 
Equations) 

(6  Regression 
Equations) 

(6  Regression 
Equations) 

1 

(6*  Regression 
Equations) 

(6  Regression . 
Equations) 

(6  Regression 
Equations) 

3 

(6  Regression 
Equations) 

(6  Regression 
Equations) 

(6  Regression 
Equations) 

RESULTS 


A  multiple  linear  regression  analysis  was  conducted  on  all  11  indepen- 

'h"S”.slxTSsrr.::it^  eesl'E  sle  “s°-  %  sei 

SESEi.  2tlo„  for  a  particular  predictor  symbol.  Because  each 
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prediction  equation  required  a  separate  regression  analysis,  a  total  of  54 
regression  equations  were  solved. 

For  example.  Table  4  shows  the  general  form  of  the  six  prediction  equa¬ 
tions  needed  to  represent  the  airplane  predictor  symbol  at  seven  seconds  in 
the  future  for  a  three  second  control  assumption  time.  Although  this  regres¬ 
sion  analysis  was  conducted  on  all  11  of  the  independent  variables  shown  in 
Equation  2,  only  the  statistically  significant  (£  <  .05)  predictors  are  shown 
in  Table  4.  Similar  sets  of  prediction  equations  were  derived  for  the  other 
treatment  conditions  summarized  in  Table  3.  In  each  case,  however,  the  specific 
set  of  statistically  reliable  partial-regression  weights  varied  somewhat. 


TABLE  4 

Prediction  Equations  with  Significant  (£  <  .05)  Independent  Variables  Used  to 
Determine  Changes  in  the  Six  Degrees  of  Freedom  of  Aircraft  Motion  for  Seven- 
Second  Prediction  Span  and  Three-Second  Neutralization  Assumption. 


A9, 


So  + 


S1  ° 


+  s"2  p' 


+  33  6B  +  34 


9b  + 


9y  + 


B6  9P 


6  + 
o 


3,  a  + 


B2  9B 


S  0 

B 


+  3, 


6Y  + 


S5  9P 


AeT 


=  3 


£ 


+  30  e 


2  B 


;  a  +  3, 

'3  Y  4 


9P  +  35 


9P  + 


3 6  R/C 


AX  =  3o  +  a  +  B2  P  +  33  9B  +  £4  ey  +  S5  0B  +  9y  +  S-  9p 
AY  =  Bq  +  6X  a  +  S7  e  +  33  cos  9g  +  B4  9B  +  S.  0p  +  R/C 

AZ  =  6q  +  Sx  a  +  B2  e  +  63  cos  9B  +  B4  0g  +  S5  0p  +  &6  R/C 


One  convenient  way  of  assessing  the  goodness  of  fit  of  each  of  these 
regression  equations  is  to  calculate  the  ipultipie  correlation  coefficient. 

The  square  of  this  value  represents  the  percent  of  variance  accounted  for 
by  the  regression  equation.  Table  5  summarizes  the  multiple  correlation 
coefficients  for  each  of  the  54  prediction  equations  of  this  study.  (For 
example,  the  multiple  correlation  coefficients  for  the  six  prediction  equa¬ 
tions  presented  in  Table  4  are  .96,  .98,  .84,  .94,  .79,  and  .85,  respectively.) 
Note  that  the  change  in  altitude  (Y)  is  the  degree  of  freedom  of  aircraft 
motion  which  resulted  in  the  lowest  multiple  correlation  coefficients. 
Generally,  the  one— second  control  assumption  time  and  the  seven— second  predic¬ 
tion  time  also  produced  regression  equations  with  lower  predictive  accuracy. 
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TABLE  5 

Multiple  Correlation  Coefficients  for  Each  Predictor  Equation 


Prediction 

Times 

Bank 

Yaw 

Pitch 

X 

Y 

Z 

Continuous 

(21  Second)  Control  Assumption 

7 

.98 

.95 

.97 

.87 

.78 

.77 

14 

.98 

.98 

.98 

.95 

.85 

.94 

21 

.99 

.98 

.97 

.97 

.92 

.96 

1 

Second  Control 

Assumption 

7 

..  9.6 

.96 

.87 

.88  4 

.66 

t 

■  .66 

14 

.97 

.95 

.94 

.92 

.71 

.61 

21 

.97 

.91 

.94 

.93 

.75 

.64 

3 

Second  Control 

Assumption 

7 

.96 

.98 

.84 

.94 

.79 

.85 

j_  -t 

.98 

.98 

.91 

.98 

.83 

.92 

21 

.98 

.98 

.95 

.98 

.85 

.92 

DISCUSSION 


The  overall  consistently  high  multiple  correlation  coefficients  obtained  in 
this  studv  indicate  that  the  regression  approach  yields  very  accurate  prediction 
equations  and  is  a  viable  alternative  to  using  the  complete,  fast-time  model. 

The  lower  multiple  correlation  coefficients  for  the  one-second  control  assump¬ 
tion  is  probablv  reflective  of  the  fact  that  a  one-second  control  input  is 
simply  too  brief' to  account  for  any  significant  movement  of  the  GAT-2  over 
the  prediction  interval.  Likewise,  the  lower  predictive  power  of  the  7  second 
prediction  times  as  compared  to  14  and  21  seconds  merely  shows  that  the  GAT-2 
dynamics  are  such  that  the  simulator  has  not  completed  a  response  to  the 
control  force  inputs.  The  longer  prediction  times  represent  a  more  complete 
simulator  response. 

A  simplification  of  this  approach  for  application  to  actual  aircraft 
would  be  to  remove  the  variables  representing  rates  of  change  of  motion 
which  are  not  normally  available.  Undoubtedly,  this  simplification  would 
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reduce  the  predictive  accuracy  of  the  regression  equations,  because  rate 
parameters  provided  significant  weightings  in  the  prediction  equations.  From 
a  behavioral  point  of  view,  however,  these  less  precise  equations  may  not 
affect  the  pilot's  performance  in  flying  the  aircraft.  Additional  research  is 
needed  to  determine  the  role  of  predictor  symbol  accuracy  in  determining 
operator  control  inputs  before  the  allowable  degree  of  predictor  simplifies- 
tion  can  be  specified. 

This  approach  to  generating  predictor  symbology  offers  the  advantages  of 
ease  of  implementation,  low  cost,  and  conformity  to  a  digitally-generated 
display.  In  fact,  this  method  may  be  better  than  an  accurate,  fast-time  model 
in  the" sense  that  time  lags  are  no  longer  proportional  to  prediction  span 
because  of  increased  computations  being  required  further  into  the  future. 
Furthermore,  the  prediction  span  need  not  be  compromised  by  repetition  rate, 
updating  frequency,  or  computing  power  available  because  any  discrete  predic¬ 
tion  is  as  easy  to  make  as  any  other. 

It  should  be  remembered  that  the  specific  prediction  equations  of  this 
study  pertain  only  to  the  control  dynamics  of  the  GAT-2  at  the  three  predic¬ 
tion  times  and  control  assumption  times  varied.  In  other  words,  the  regres¬ 
sion  equations  are  always  specific  to  the  device  from  which  the  data  are 
collected.  The  approach  and  procedure  for  generating  these  regression  equa¬ 
tions,  however,  are  general  and  can  be  applied  to  generating  predictor 
svmbology  for  anv  specific  device.  Obviously,  there  probably  are  situations 
in  which  a  multiple  linear  regression  may  not  provide  an  adequate  reoresenfa- 
tion  of  the  true  underlying  system  dynamics.  In  such  instances  a  regression 
approach  is  still  appropriate,  because  it  can  be  easily  extended  to  higher- 
order,  polynomial  regression  representations  of  these  complex  functions. 
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INTRODUCTION  TO  A  COORDINATED  COCKPIT  DISPLAY  CONCEPT 
By  D.  L.  Baty 

Ames  Research  Center,  NASA 
Moffett  Field,  California  94035 


ABSTRACT 


The  aircraft  instrument  panel  has  evolved  as  an  ever  growing  collection 
of  subsystem  indicators.  Although  human  factors  design  has  reduced  the  difn- 
culty  in  interpreting  the  displayed  parameters,  it  has  not  resulted  in  a 
quickly  and  accurately  assimilated  body  of  information.  Some  efforts  have 
been  made  to  integrate  displays  from  the  pilot  s  viewpoint  as  evidenced  by 
electronic  attitude  director  indicators  and  cathode  ray  tube  (CRT)  map  dis- 
plays.  However,  with  these  displays,  full  advantage  has  not  yet  been  taken  of 
the  potential  of  current  computer  and  display,  device  technology.  This  paper 
describes  an  initial  three-display  contiguration,  based  on  a  rationale  devel¬ 
oped  earlier,  that  is  designed  to  present  flight  status  information. 
emphasis  is  on  the  factors  that  contribute  to  efficient  perception  and  inte¬ 
gration  of  the  pilot's  total  spatial  situation.  The  display  system  consists 
of  three  orthogonal  views  containing  both  qualitative  and  quantitative  in  or 
^ti0n  presented  on  three  17.7  *  17.7  cm  (7  *  7  in.)  beam-penetration  color 
CRTs.  Therefore,  the  displays  will  show  different  views  of  the  same  mrorma- 
tion  In  addition,  to  increase  the  visual  separation  of  the  information  ele¬ 

ments  the  colors  red,  green,  and  yellow  are  used  to  depict,  respectively,  the 
control ,  performance,  and  navigation  categories  of  flight  instrumentation.  As 
a  result  the  displays  are  coordinated  in  inrormation  ana  color,  tnererore,  tne 
name  Coordinated  Cockpit  Display  (CCD). 

INTRODUCTION 


The  Man-Machine  Integration  Branch,  Flight  Management  Program  at  NASA- 
Ames  Research  Center  is  committed  to  study  and  research  of  pilot  procedures 
and  pilot-systems  interfaces  that  will  be  required  for  aircraft  operating 

°L  Kional  Airspace  System  of  the  1980-1990,  (ref.  1).  The  study  ot 
pilot  information  and  display  requirements  is  an  integral  part  o.  this  Program 
These  requirements  are  generated  by  many  complex  systems  ranging  in  scope 
the  overall  National  Airspace  System  to  individual  on-board  avionics  systems. 
If  the  pilot  is  to  cope  efficiently  with  all  relevant  information,  careful 
bought  must  be  given  to  the  method  of  its  presentation.  In  a  separate  report 
(ref  2)  several  system  and  perceptual  problems  were  discussed  and  the 
rationale  for  an  initial  display  set  was  evolved.  Three  general  design  goals 
resulted  from  that  discussion: 

1.  Divorce  the  display  configuration  from  individual  subsystems. 
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2  Display  information  to  the  pilot  in  such  a  manner  that  he  always  uses 

the  same  displays  regardless  of  the  role  he  is  actively  taking. 

3.  Provide  pictorial  representations  of  the  aircraft  situation  with 
quantitative  information  appropriately  related  to  the  picture. 

Based  on  these  concepts  the  first  prototype  system  was  designed  as 
described  in  this  report.  The  display  system  is  a  set  of  three,  beam- 
penetration  color  cathode-ray  tubes  (CRTs).  Since  one  of  three  orthogonal 
croiections  of  the  aircraft  situation  will  appear  on  each  CRT  the  displays 
will  show  different  views  of  the  same  information.  The  color  feature  is 
included  primarily  to  obtain  visual  separation  of  information  elements,  but 
additional  advantage  is  taken  of  this  capability  by  differentiating  c°ntr°1’ 
performance,  and  navigation  information  on  the  three  displays  with  red,  green, 
and  vellow,  respectively.  Therefore,  the  displays  are  coordinated  in  informa¬ 
tion  and  color,  and  the  name  Coordinated  Cockpit  Display  (CCD)  was  chosen  to 
emphasize  this  feature  of  the  system.  Changes  in  internal  detail,  but  not  in 
overall  concept,  can  be  expected  in  subsequent  displays. 

'  4 

GENERAL  FEATURES  OF  THE  CCD 


The  three-displav  configuration  described  here  is  based  on  three  ortho 
gonal  projections  of  the  aircraft  situation:  (1)  perpendicular  to  the 
pilot's  forward  line-of-sight ;  (2)  parallel  to  the  ground;  and  (3)  perpen¬ 
dicular  to  the  other  two.  Figure  1  illustrates  the  relationships. 

The  first  disnlav  is  most  closelv  related  to  the  pilot's  view  out  the 
front  window  and  is  perpendicular  to  the  e.ith.  For  the  CCD  system  this  is 
called  the  Vertical  Situation  Display  (VSD)  (fig.  1).  Because  the  frame  of 
reference  moves  in  response  to  aircraft  attitude  the  first  CRT  presentations 
of  this  type  were  called  Electronic  Attitude  Director  Indicators  (EADI)  and 
that  designation  has  remained  (ref.  3).  This  reference  to  attitude  is  too 
restrictive  and  the  term  vertical  situation  display  is  currently  more  descrip 
tive  of  the  broader  function  visualized  for  this  display. 

The  second  display  represents  the  horizontal  situation  and  is  thus  called 
the  Horizontal  Situation  Display  (HSD) .  This  plane  is  parallel  to  the  earth  s 
surface  and  is  the  plane  in  which  maps  are  commonly  drawn. 

The  display  that  will  show  the  pilot’s  situation  in  a  plane  perpendicular 
to  the  earth  and  parallel  to  the  pilot's  forwari  line  of  sight  is  called  the 
Side  Vertical  Situation  Display  (SVSD) .  In  the  past,  very  little  attention 
has  been  given  to  this  view  of  the  flight  situation.  In  addition  to  other 
features  to  be  described,  this  pictorial  view  will  be  ideal  for  explicitiy 
showing  the  altitude  situation,  and  should  make  it  easy  for  the  pilot  to  main 
tain  altitude  awareness. 

Th->  combination  of  these  three  displays  unambiguously  shows  the  total 
flight  situation.  Each  display  explicitly  represents  two  dimensions  in  space 
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and  shares  one  of  those  two  dimensions  with  each  of  the  other  two  displays. 

For  example,  the  display  elements  to  be  shown  on  the  SVSD  will  represent 
up/down  and  fore/aft  situation  information;  the  up/down  dimension  is  also  one 
of  the  VSD  dimensions,  and  the  fore/aft  dimension  is  one  of  the  HSD  dimensions. 
Therefore,  each  of  the  three  displays  is  capable  of  showing  different  views  of 
the  same  information;  for  example,  a  waypoint  in  space  with  a  line  joining  the 
aircraft  with  the  waypoint.  By  constructing  these  different  views  of  selected 
information  the  displays  are  tied  together,  or  coordinated,  in  terms  of 
information  content.  The  display  elements  are  also  to  be  color  coded  accord¬ 
ing  to  three  (perhaps  four)  classes  of  function  that  will  be  the  same  on  all 
three  displays. 

As  prime  instruments,  the  three  CRTs  will  be  mounted  in  the  center  of  the 
aircraft  instrument  panel  with  the  display  surfaces  perpendicular  to  the 
pilot1  s  line-of-sight .  This  is  not  ideal  since  the  pilot  will  have  to  men¬ 
tally  rotate  coordinates  to  correspond  with  the  real  world.  However,  the 
alternatives  of  positioning  the  scope  faces  parallel  to  the  planes  they  repre¬ 
sent,  either  at  the  instrument  panel  or  closer  to  the  pilot,  present  major 
practical  difficulties.  To  keep  this  mental  rotation  task  as  simple  as  pos¬ 
sible  the  three  displays  will  be  positioned  as  shown  in  figure  2.  This  is  the 
relationship  that  results  if  the  three  planes  depicted  in  figure  1  were  folded 
outward  as  if  they  were  three  sides  of  a  box. 


Color  Coding 


As  already  mentioned  in  the  Introduction,  color  will  be  used  as  part  of 
this  display  system,  primarily  to  obtain  visual  separation  of  the  information 
elements.  The  usefulness  of  different  colors  to  separate  displav  elements  is 
well  demonstrated  by  current  mechanical  flight  directors.  Because  monochrome 
CRTs  lack  color  separation,  they  become  visually  cluttered  by  even  a  few  ele¬ 
ments.  Shape,  intensity,  and  line  coding  do  little  to  relieve  the  problem. 
The  beam-penetration  CRTs  to  be  used  with  the  initial  display  system  can 
generate  three  basic  colors:  red,  green  and  yellow.  (Other  intermediate 
colors,  such  as  orange,  can  also  be  generated,  but  red,  green,  and  yellow  are 
the  most  easily  discriminable . )  Rather  than  arbitrarily  assigning  a  color  to 
each  display  element,  a  search  was  made  for  some  consistent  color  assignment 
scheme  that  would  also  fulfill  the  visual  separation  requirement .  An  instru¬ 
ment  classification  scheme  used  by  the  Air  Force  provides  three  categories  to 
match  the  three  basic  colors. 


Air  Force  Manual  51-37  divides  flight  instruments  into  three  categories: 
control,  performance,  and  navigation  instruments  (ref.  4).  Loosely  defined, 
the  control  instruments  indicate  first  response  to  control  inputs  such  as  air¬ 
craft  attitude  and  engine  power;  the  performance  instruments  indicate  the 
effects  of  changes  in  the  control  parameters,  such  as  pitch  changes  resulting 
in  altitude  and  airspeed  changes;  and  the  navigation  instruments  indicate  air¬ 
craft  position  relative  to  ground  references.  These  three  categories  can  also 
be  referred  to  as  inner,  middle,  and  outer  loop  control. 

The  colors  red,  green,  and  yellow  have  been  assigned  to  control,  perfor¬ 
mance,  and  navigation  informs  tier*,  respectively.  (This  is  probably  not  critical 


\ 

t 


from  a  perceptual  standpoint.)  Red  was  chosen  for  control  information  for 
three  reasons:  (1)  pilot  response  to  control  requirements  must  be  relatively 
nuick  and  red  is  traditionally  associated  with  a  requirement  for  immediate 

(7}  there  are  fewer  elements  of  control  information  than  is  the  case 

and  navigation  so  loss  d™.nd  to  "look  at  ted";  and  (3)  because 
red  elements  will  probably  require  two  beam  tracings  to  attain  the  desired 
brightness  level,  assigning  fewer  elements  to  red  will  save  computer  time. 
TheSpresent  green  and  yellow  assignments  were  given  because  early  color  draw¬ 
ings  of  potential  displays  were  aesthetically  more  pleasing  to  the  writer. 


Research  Hardware 

The  lines  and  dots  that  make  up  the  display  elements  are  generated  by  an 
Evans  and  Sutherland  LDS-2,  modified  to  drive  beam-penetration  color  CRTs. 
Each  color  CRT  measures  17.7  *  1,./  cm  (7  *  /  in.).  An  SEL-840  computer 
interfaces  with  the  LDS-2  to  generate  aircrart  dynamics,  navigation  and  gui. 
ance  equations,  and  performance  recording. 


FEATURES  OF  THE  INDIVIDUAL  CCD  DISPLAYS 


The  CCD  concept  as  outlined  so  far  is  quite  simple.  However,  when  the 
amount  of  specific’ information  that  could  go  on  each  display  is  considered, 
along  with  the  different  possible  forms  that  could  be  given  to  each  piece  of 
information,  it  is  clear  that  the  implementation  of  CCD  could  become  complex.^ 
S  the  following  description  of  the  individual  CCD  displays  only  one  form  or 
selected  information  is  described.  It  is  to  be  understood  that  changes  will 
be  made  to  accommodate  the  requirements  or  speciric  experiments,  and  tne  pur¬ 
pose  of  these  experiments  will  be  to  seek  better  forms  or  the  displays. 


Vertical  Situation  Display 


The  Vertical  Situation  Display  is  the  primary  display  for  aircraft  atti¬ 
tude  Since  evervthing  is  referenced  to  the  direction  of  flight,  the  center 
of  the  display  can  easily  become  overly  cluttered  with  aircraft  symbol,  hori¬ 
zon  line,  pitch  marks,  runway  symbol  and  other  aiming  points.  For  this  reason 
everything  that  might. logically  go  on  this  display  cannot  be  accommodated  at 
the  same  time.  One  configuration  of  the  VSD  is  shown  in  figure  3  Element 
color  assignments  given  in  the  text  below  are  summarized  in  Tabxe  1. 

This  method  of  showing  the  attitude  situation  is  fairly  standard.  The 
combination  of  aircraft  symbol  (fixed),  horizon  line,  and  roll  angle  marker 
show  a  10°  left  bank  and  zero  degree  pitch  angle.  These  elements  will  all  be 

shown  in  red. 

The  ground  plane  is  differentiated  from  the  skvplane  by  a  perspective  dot 
pattern.  The  rate  of  downward  motion  could  be  programmed  to  be  a  function  or 
forward  velocity  and/or  altitude  ^d  ma^be^studied  at  ^laterjime. 

0 ieved , 


eXOCi.LV  duu  /  uj.  00.1.  - -  - 

K0T.roVer,  that  the  mst  iipnort-ant-  function  or  these  dots  is  the 
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__mind  oiane/skvplane  differentiation  (ref.  5)  and  secondarily  the  general 
ground  pla  J  yP  6)  Altitude  and  velocity  coding  would  encounter  range 

StrM^-'  for eZtle  tte  dots  would  be  so  far  apart  close  to  the  ground  that 
the^*  visual  illusion  of  the  ground  plane  would  be  lost.  The  ground  plane  dots 
will  be  yellow. 

The  oerformance  information  that  will  have  to  be  read  most  precisely  dur¬ 
ing  critical  maneuvers  surrounds  the  central  attitude  display.  The  altitude 
mg  critic  rieht  has  a  natural  up/down  relationship  on  this  dis- 

52'“ ’111?  8h“d5»  ”the  top  of  the  display  has  a  hatutal  tight/ 

Plift‘ relationship.  Airspeed  has  no  natural  position  correspondence  so  the 
stlndaJd  population  stereotype,  reading  upward  for  larger  values  was  adopted 
/  c  7s  Heading  altitude,  and  airspeed  are  each  read  as  a  combination 

. ’  ’  e  and  digital  readout,  taking  advantage  of  the  best  features  of  both. 

rgit  1  readout  S  be  read  mom  quickly  and  accurately  than  an  analogue 
Digital  reaa  rate  judgments.  A  moving  tape  provides  rate  and 

1 ead° in  formation . ^  In  operation,  the  moving  tape  numbers  are  blanked  from  the 
digital  readout  box  so  that  the  visual  effect  is  that  of  the  tape  moving 
behind  the  box.  For  this  simulation  there  will  be  provision  for  choosing 
either  moving  tape  or  digital  readout  separately  before  beginning  a  .lig  t. 

The  rate  of  change  of  heading  and  altitude,  more  commonly  known  as  turn 
rate  and  SSanLneoJ  vertical  speed  indication  (IVSI,  respectively  are 

displayed  ^ ST^imr^s  will 

shown  in  terms  of  a  standard  3  '8®J  the  IVSI  will  be  scaled  for  one 

probably  need  If  needed,  a  speed  command  or  error  bug 

:nr:rs«»sdth“iS^l"3P..  ^  t„.s.  »» left,  to,.  m*. 

of  the  VST)  will  be  green. 

Two  Dieces  of  information,  flight  path  angle  (FPA)  and  potential  flight 
oath  angle(PFPA),  have  been  combined  into  one  symbol  so  that  the  relationship 
a\S'wo  ;leces  0f  information  cannot  be  lost  among  other  symooxs  on 

called*" the  aiming  point  and  a  line  extending  from  the  aircraft  toward  this 
called  t  s  vector.  This  symbol  can  be  used  to  show  flight 

point  18  called  the  ^  t<)  any' spatially  located  point  such  as  a 

are  ^il^e^n^ilot^incerpretation^roblems^witl^different^simulatec^weather 

conditions^  Gr^ i. Te  lolor  fro.\he  inner  tips  of  the  symbol  to  the  pivot 
or  bending  point. 

Thp  PFPA  is  referenced  to  the  FPA.  When  the  PFPA  is  level  with  the  FPA, 
The  PFPA  is  rereren  flight  path  is  equal  to  zero,  therefore 

tHe  dC-;  constant  I?  mA  is  above  FPA  t’ne  acceleration  is  positive  and  the 
Speed  11° increase -if  PFPA  is  below  the  FPA,  acceleration  is  negative  and 

spee  jopyLcp  These  two  indicators  make  the  effect  of  changes  in 

s“«ing! flaps',  l.n<lin»  seat,  ett. .  i-ehiately  *«««“  “  JJ*  P11”' 
of  th.  use  of  these  to.  Ot.plV  tM 
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example  shown  in  figure  3.  The  potential  flight  path  is  shown  as  being  8° 
below  current  flight  path.  Without  a  power  change,  a  continuation  of  tne 
flight  path  shown  will  lead  to  a  reduction  in  airspeed.  The  pilot  can  use  the 
information  to  increase  throttle  until  the  potential  flight  path  reads  the 
same  value  as  for  flight  path,  thereby  maintaining  current  flight  path  and 
airspeed.  Or,  continuing  with  the  example  in  figure  3,  the  pilot  can  maintain 
current  airspeed  without  changing  thrust  by  pitching  down  until  the  flight 
path  matches  the  potential  flight  path.  Because  potential  flight  path  is  a 
directly  controlled  variable,  the  "flat”  portion*  of  the  symbol  is  in  red. 

Not  shown  in  figure  3,  but  planned  for  evaluation  are  waypoint  guidance, 
runway  and  touchdown  point,  and  a  method  for  showing  a  3-D  perspective  of 
desired  flight  path;  for  example,  tunnel  or  channel  display  (ref.  8). 


Side  Vertical  Situation  Display 

The  Side  Vertical  Situation  Display  clearly  and  unambiguously  relates 
present  aircraft'  altitude  to  future  altitude  requirements  *(fig.  4)..  The  air¬ 
craft  symbol  (red)  remains  fixed  at  the  altitude  digital  readout  box  (green). 
This  accomplishes  two  purposes:  (1)  the  aircraft  altitude  reference  is 
explicitly  established  and  (2)  a  required  second  altimeter  is  provided.  The 
altitude  on  the  VSD  is  from  radar  and  the  altitude  on  the  SVSD  is  barometric. 
The  operation  of  the  moving  tape/digital  readout  is  the  same  as  described  for 
the  VSD.  To  enhance  terrain  altitude  awareness,  significant  terrain  features 
(yellow)  can  be  shown  referenced  to  the  moving  tape.  Logic  will  have  to  be 
provided  to  change  these  features  as  a  function  of  lateral  displacement  from 
desired  ground  track. 

Flight  path  angle  (green)  and  potential  flight  path  angle  (red)  are 
accurately  read  against  an  expanded  angle  scale  (3:1  in  fig.  4).  The  vertical 
relationships  are  the  same  as  previously  explained  for  the  VSD.  The  aircraft 
symbol  rotates  about  its  midpoint  to  indicate  aircraft  attitude. 

An  IVSI  digital  readout  (green)  in  the  upper  left  comer  supplies  abso¬ 
lute  vertical  speed  information,  supplementing  the  analogue  readout  on  the 
VSD.  An  arrow  appearing  above  or  below  the  box  reinforces  the  sign  informa¬ 
tion  regarding  up  or  down  velocity  of  the  aircraft.  There  is  a  ±50  ft/min 
dead  band  about  zero  ft/min  so  that  the  arrow  is  not  continually  flipping  over 
when  the  aircraft  is  flying  straight  and  level.  In  keeping  with  the  philosophy 
of  relating  quantitative  information  to  qualitative  information  this  vertical 
speed  information  should  be  closely  related  to  the  flight  path  angle  or  the 
aircraft  symbol.  Initial  attempts  to  do  so  resulted  in  excessive  clutter  and 
loss  of  other  information.  As  with  all  items  cn  these  displays  its  final  form 
is  yet  undetermined. 

A  segmented  line  (yellow),  moving  toward  the  aircraft  symbol,  indicates 
the  desired  vertical  track.  Relevant  tags  are  shown  at  waypoints,  marker 
beacons,  etc.  Vertical  and  horizontal  scaling  must  be  compatible  with  the 
flight  path  angle  scaling. 
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Horizontal  Situation  Display 

The  Horizontal  Situation  Display  (fig.  5)  relates  the  aircraft  (red)  to 
its  geographic  position.  This  may  be  shown  as  aircraft  position  relative  to  a 
desired  course  line,  navigation  aids,  waypoints,  runways,  or  prominent  geo¬ 
graphic  features,  all  of  which  would  be  shown  in  yellow. 

The  horizontal  projection  of  the  velocity  vector  or  flight  path  (called  a 
trend  vector  by  Boeing),  the  range  altitude  symbol,  and  ground  speed  and  wind- 

speed  vectors  would  be  green.  The  range  altitude  symbol  shows  the  point  at 

which  the  next  waypoint  altitude  will  be  reached  if  the  present  vertical  com¬ 
ponent  of  the  velocity  vector  is  maintained. 

display  is  to  be  used  for  manual  control  the  lateral  track  error 
can  be  expanded  by  some  factor  and  shown  by  a  bar  parallel  to  the  aircraft,  as 

if  a  portion  of  the  guide  line  had  been  cut  out  and  expanded. 

Sufficient  work  has  been  done  to  show  the  utility  of  a  predictor  on  the 
HSD  (refs.  9,  10)  so  that  an  evaluation  of  a  predicto.r  (not  shown  in  fig.  5) 
will  be  part’of  this  work.  (This  may  eventually  include  evaluation  of  predic¬ 
tors  on  the  VSD  and  SVSD  as  well.)  A] so  not  shown,  but  candidates  for  HSD 
presentation,  are  time  slot  information  for  4-D  navigation  and  symbols  showing 
other  aircraft  for  traffic  situation  information  (refs.  11,  12). 


Coordinated  Cockpit  Display  Evaluation 

The  goal  of  the  CCD  concept  is  to  present  flight  information  explicitly 
in  its  situational  context.  The  advantages  and  disadvantages  of  this  aporoach 
remain  to  be  studied.  In  the  first  simulator  study,  pilots  will  manually  fly 
a  complex,  decelerating  landing  approach  with  go-around  at  200  ft  before 
touchdown.  Using  this  task,  pilot  performance  will  be  compared  when  using  the 
CCD  or  standard  instruments.  Pilots  will  be  interviewed  for  opinions,  com¬ 
ments,  suggested  changes,  and  additions  or  deletions. 

As  display  ideas  evolve  it  is  expected  that  various  configurations  of  the 
CCD  will  be  compared  so  that  new  ideas  on  display  content  and  form  can  be 
evaluated.  In  a  parallel  effort  the  CCD  will  also  be  integrated  into  a  full 
mission  simulation  and  avaluited  in  the  larger  context  of  complex  navigation 
with  an  air  traffic  control  system. 
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TABLE  1.-  Display  Element  Color  Assignments 


Display  elements 


Yellow 


Horizon  line  and  pitch  marks 

Aircraft  symbol 

Roll  angle 

Ground  plane  dots 

Altitude  tape  and  digital  readout 

Airspeed  tape  and  digital  readout 

Heading  tape  and  digital  readout 

Turn  rate 

IVSI 

Flight  path  (FPA) 

Potential  flight  path  (PFPA) 
Waypoint  guidance  (not  shown) 
Runway  (not  shown) 

Tunnel  or  channel  (not  shown) 

~  SVSD 
Aircraft  symbol 

Altitude  tape  and  digital  readout 
Terrain  features  (not  shown) 
Flight  path  (FPA) 

Potential  flight  path  (PFPA) 

Angle  scale 
IVSI 

Desired  vertical  track 
Waypoints,  beacons,  etc. 

"  ”  HSP 

Aircraft  symbol 
Flight  path 
Range  altitude 
Ground/windspeed  vectors 
Desired  course  lin^ 

Expanded  error  bar 
Navigation  aid 
Waypoint 
Runway 

Obstructions 


X 

X 

X 

X 

X 

X 

X 

X 

X 

X 

XXX  XX 
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of  aircraft  situation. 
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SUMMARY 


A  mixture  of  aircraft  (A/C)  with  and  without  traffic  situation 
displays  (TSD)  was  simulated  to  ascertain  its  effects  on  distributed 
air  traffic  management  in  the  terminal  area. 

A  particular  type  of  distributed  air  traffic  management  (sequencing) 
was  used  as  determined  from  previous  experiments  with  the  nonTSD 
aircraft  being  vectored.  This  mixed  condition  consisting  of  one  A/C 
simulator  without  a  TSD  and  two  with  TSD  was  compared  to  a  baseline 
vectoring  condition  in  which  all  3  simulator  A/C  lacked  TSD. 

The  three  simulator  A/C  and  four  or  five  computer  simulated  A/C 
were  embedded  in  a  terminal  area  traffic  problem  with  as  much  realism 
as  possible.  All  A/C  were  considered  co  be  STOLcraft. 

Analyses  were  made  of  flight  performance  measures,  verbal  communica¬ 
tions  and  subjective  evaluations  by  the  professional  pilots  and  controllers 
who  served  as  subjects. 


The  analyses  favor  the  TSD  equipped  A/C  and  the  distributed  mode  of 
management  permitted  by  this  cockpit  capability.  However  there  are_ 
indications  that  an  A/C  without  a  TSD  in  a  TSD  environment  may  require  or 
receive  considerably  more  controller  attention  and  pilot  disfavor  than 
when  it  is  in  an  all  vectored  environment.  This  may  imply  that  TSD  and 


nonTSD  A/C  should  be  segregated  and  controlled  accordingly. 
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INTRODUCTION 


,  .  or  of  experiments  and  reports  (1,  2,  3)  have  now  documented 

Mo  notential  that  Traffic  Situation  Displays  (TSD)  in  the 
"  increasing  Che  safety,  orderliness  and  expeditiousness 

coc^p  ^  .  traffic  management  without  increasing  pilot  or 

their  TSD  without  need  for  any  vectoring 

ro  date  have  not  treated  the  very  reasonable 
However,  the  experiments  to^da^  in  which 

some  A/C  have 

possibility  of  a  mixed  traftic  J1' ^PThis  situation  could  arise  by  failure 

TSD  r^SD'^an  A/C  I?  could^lso  arise  through  the  more  common  possi- 
or  the  TSD  in  an  A/C.  other  evolutlonary 

bility  that  not  all  A/C  will  ^v?/c  °n  a  traffic  information  basis. 

factors  thus  pro  ucing  a  ; ■  ‘  g  special  management  problems  and 

it  al-  -a  *  ■**  °£  */C 

might  have  its  own  impacts  on  efficient  management. 

In  order  to  investigate  the 

eXPeriSon  “aSciUtiesmand  programs' developed  in  the  Man-Machine  Integration 
simulation  -acilit  studying  the  group  interactive  aspects 

Branch.  This  facility  pe  ffic  raanagement  based  on  exploitation 

and  performance  of  terminal  area  tra  bailable  in  the  cockpit 

of  the  360°  view  of  surrounding  t-a..-v  -a-e 
by  a  Traffic  Situation  Display. 

a  nrpvious  studyU)  showed  that  distributed  management  of  3  A/C 

•  S  SJhTSD  not  onlv  possible  but  also  generally  superior  m  a 

with  ISD  was  not  ^  ce„t„u«d  storing  method  of  control. 

ta  olrtlcuto,  n  f"»  of  distribute  n.n.ge.ent  termed  «e.n«y=iS6  nppontnd 
In  partic  9  alternative  form  of  distributed  management.  In  the 
superior  t  nnnrroller  issued  onlv  seauence  order  to  each  TSD 

sequencing  mode,  the  c  achieve  ^  in  VFR  fashion  chus  vastiy 

equipped  A/C  having  tne  load  without  any  substantial  increase  m 

reducing  controller  f“e  for  this  study,  the  sequencing  mode  of 

SltSSa  neSetni  lot  the  TS>  eguiPP-d  A/C.  AH  non  TSD 

A/C  were  vectored. 

-  “  -ssr. russstf  “« ass 

SS/sSSA sssst 

S“b:”l“dT“  anC independent  liable  .long  .  continuum  oith  the 
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ratio  possibly  determining  the  best  form  of  traffic  management*  The 
mixed  ("nonvectoring")  condition  was  thus  a  simultaneous  VFR-IFR  environ¬ 
ment. 


The  two  basic  conditions  (vectoring,  nonvectoring)  were  compared 
and  within  the  nonvectoring  (mixed)  condition,  the  two  specific  manage¬ 
ment  techniques  (Sequencing,  Vectoring)  were  further  compared.  Comparisons 
were  based  on  flight  performance  measures,  verbal  communications  and 
subjective  evaluations  obtained  after  each  run  and  after  the  experiment. 
Three  groups  of  three  pilots  and  two  controllers  per  group  were  used  to 
test  for  group  specific  differences  in  the  results.  All  subjects  were 
practicing  professionals. 


METHOD 


1.  Task 

The  basic  task  is  referenced  to  the  map  shown  in  Figure  1. 


Figure  1.  Ground  Projection  of  the  Task  Layout.  Distances  are  given 
in  Nautical  Miles.  A  STOL  Terminal  Area  is  simulated. 

The  Locations  of  the  Outer  Marker  (LOM)  and  Middle  Marker  (MM)  were, 
appropriate  for  a  STOL  airport.  There  were  three  approaches  to  the  airfield 
and  a  go-around  (goa)  route  to  be  used  in  case  of  a  missed  approach. 
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Suggested  altitudes  are  given  at  each  waypoint  (WP)  for  traffic  purposes 
although  once  past  WP  12,  the  altitudes  shown  are  appropriate  for  a  6 
STOL  approach  which  all  craft  were  required  to  execute. 

Computer  generated  A/C  were  introduced  at  WP  11  every  two  minutes 
on  the  average  with  a  5  second  standard  deviation.  Once  introduced, 
each  A/C  followed  a  standard  altitude  and  speed  profile  appropriate 
for  its  type.  The  controllers  could  modify  only  the  speed  of  these  craft 
through  kevboard  input.  Controllers  could  "hold"  the  computer  A/C  at 
WP  11  and  subsequently  resume  their  introduction  through  appropriate 
keyboard  commands. 

The  piloted  simulator  A/C  were  initially  introduced  anywhere  between 
or  on  the  045°  and  165°  approaches  to  the  terminal  area  with  appropriate 
headings  and  speeds.  Pilots  had  aileron,  elevator  and  throttle  control 
of  their  craft. 

The  basic  task  required  that  the  3  simulator  craft  be  inserted 
between  the  computer  A/C  (scheduled  to  cross  WP  12  every  two  minutes 
with  some  variation.)  A  basic  rule  limited  spacing  tc  a  minimum  of  lnm  _ 
or  60  second  between  A/C  crossing  the  middle  marker.  lA  fuller  description 
of  the  problem  and  subject  instructions  is  given  in  reference  5.) 

Two  management  conditions  for  accomplishing  this  task  were  compared. 

In  the  first  condition  (vectoring)  all  3  A/C  were  without  TSD.  Each  pilot 
saw  the  map  of  Figure  1  and  his  own  A/C  position  on  it.  Because  no 
visual  traffic  information  was  available  to  the  pilots  they  had  to  be 
vectored.  In  the  second  condition  (nonvectoring),  two  of  the  three 
simulators  had  full  360°  visual  traffic  information  while  the  third  remained 
as  in  the  first  condition  with  the  map  and  only  his  own  position  shown  on 
it  Thus  two  of  these  A/C  (with  TSD)  could  operate  in  the  sequencing  mode 
while  the  third  A/C  (without  TSD)  required  vectoring.  A  common  voice 
circuit  was  used  between  all  pilots  and  controllers.  Each  pilot  had  a 
vertical  situation  display  identical  to  that  used  in  previous  studies  and 
a  description  may  be  found  in  Reference  2.  A  30  second  path  predictor 
was  always  present  on  own  A/C  only. 

The  basic  objective  of  this  study  was  to  determine  the  impact  such 
a  heterogeneous  mixture  of  A/C  with  and  without  traffic  infomaation  would 
have  on  terminal  area  traffic  management  when  compared  with  a  baseline 
vectoring  condition. 

2.  Measures 

Three  types  of  measures  were  obtained  in  this  experiment.  Objective 
(flight  performance)  measures  of  variables  such  as  aileron,  elevator  and 
throttle  activity  in  the  simulators;  final  airspeed,  heading,  glideslope, 
etc.,  errors,  pitch  and  roll  rates  and  others:  verbal  measures  based  on 
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tape  recordings  of  the  pilots  and  controller  exchanges:  and  subjective 
measures  obtained  from  questionnaires  completed  after  each  run  and  after 
the  total  experiment. 

The  experiment  was  replicated  on  three  separate  groups  of  three 
pilots  and  two  controllers  per  group.  Three  runs  of  each  of  the  two 
experimental  conditions  were  obtained  from  each  group.  A  run  lasted 
20  to  25  minutes  and  each  group  received  approximately  4  hours  of  practice 
under  both  conditions.  Practice  and  test  runs  were  made  on  separate 
days.  All  15  subjects  were  current  airline  pilots  or  air  traffic  controllers 
from  the  San  Francisco-Oakland  region. 

To  reiterate,  in  the  Vectoring  condition,  no  A/C  had  a  TSD  and  thus 
all  A/C  required  vectoring.  In  the  Nonvectoring  condition,  the  two  A/C 
with  traffic  displays  received  sequence  order  information  only  from  the 
controller  and  were  left  to  achieve  that  order  via  their  TSD  and  any  verbal 
communication  necessary  between  the  pilots.  The  single  A/C  without  a 
TSD  had  to  be  vectored  by  the  controller  among  the  A/C  flying  "VFR." 

Thus  the  nonvectoring  condition  had  both  VFR  (TSD  equipped  A/C)  and  IFR 
(nonTSD  equipped  A/C)  traffic. 


RESULTS 


The  results  are  highlighted  here  with  more  detailed  analyses  to 
be  found  in  reference  5.  Results  are  presented  according  to  the  three 
types  of  measures. 

1.  Objective  Measures 


A.  Comparisons 


A  battery  of  seventeen  objective  performance  measures  was  recorded 
on  each  of  the  3  piloted  simulated  aircraft.  These  variables  are  not 
statistically  independent  of  each  other.  However,  an  assessment  of  the 
number  of  independent  components  present  in  the  battery  will  depend,  to 
some  extent,  on  the  results  of  the  present  analyses.  Application  of  principal 
components  and_factor  analytic-procedures  are  therefore  postponed  to  a 
later  report. ^  Sixteen  of  the  variables  may  be  thought  of  as  belonging, 
to  four  general  groups  as  follows: 


Group  1  -  Landing  Accuracy 
FAE:  final  airspeed  error 
FGE:  final  glideslope  error 
FHE:  final  heading  error 
FLE:  final  lateral  error 


Group  2  -  Manual  Workload 
AA:  aileron  activity 
EA:  elevator  activity 
TM:  throttle  movements 
TM/DT:  throttle  movements 

per  unit  of  flight  time 
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Group  3  -  Expedience /Economy 
HPL:  horizontal  path  length 
DX  :  duration  of  flight 
STM:  speed  total  maneuvering 
ICT:  intercrossing  time 


Group  4  -  Passenger  Comfort 
RRA:  roll  rms  acceleration 
PRA:  pitch  rms  acceleration 
TAC:  throttle  average  change 
STM/DT:  speed  total  man.  per 
unit  of  flight  time 


All  measures  in  the  Landing  Accuracy  group  were  made  as  the  aircraft 
passed  the  middle  marker  (MM  on  Figure  1)  during  the  final  approach. 

In  the  second  group,  the  aileron  and  elevator  activities  are  rms 
values  of  these  controls  during  the  simulated  flight.  Throttle  movements 
(TM)  which  resulted  !n  speed  changes  of  five  knots  or  more  were  counted 
for  this  variable.  The  variable  TM/DT  is  the  throttle  movement  count 
divided  by  the  duration  of  the  flight. 

The  third  group  of  output  variables  includes  the  total  length  of 
the  aircraft  ground  track  (HPL)  and  the  time  duration  of  the  simulated 
flight  (DT) .  These  two  variables  are  highly  correlated.  Speed  total 
maneuvering  (STM)  is  the  sum  of  all  speed  changes  during  the  flight. 
Intercrossing  time  (ICT)  is  the  time  interval  between  a  given  aircraft 
following  the  one  ahead  of  it  across  the  middle  marker  on  the  final 
approach.  (Subjects  were  instructed  to  maintain  an  intercrossing  time^ 
as  close  to  sixty  seconds  as  possible.)  The  lowest  mean  ICT  for  any  or 
the  ATC  management  schemes  war.  in  excess  of  the  sixty  second  guideline. 


The  final  group  of  output  variables  describes  the  aircraft  motion 
as  it  may  affect  passenger  comfort.  The  first  two  variables  are  the  rms 
values  of  roll  and  pitch  accelerations.  The  average  throttle  cnange  U«C ; 
was  computed  using  only  throttle  changes  which  would  result  in  speeds  in 
excess  of  five  knots  and,  therefore,  reflects  the  more  gross  and. perceptible 
speed  changes.  The  speed  total  maneuvering  was  divided  by  the  flight 
duration  to  give  the  last  variable  in  this  group,  STM/DT. 


For  each  of  these  sixteen  variables,  the  lowest  value  is  also  the 
most  preferred  value.  A  seventeenth  output  variable  is  the  average  speed 
of  the  aircraft  (AVSPD).  There  is  no  obvious  optimal  value  for  average 
speed  and  since  there  was  no  significant  difference  in  this  variable  among 
the  several  ATC  management  schemes,  it  was  not  included  in  the 
summary  of  results  to  follow. 


An  a  priori  summary  of  the  objective  performance  results  is 
presented  in  Figure  2,  showing  four  ATC  management  categories..  The  first 
category,  vectoring,  includes  44  completed  flights.  The  cockpits  of 
simulators  were  equipped  with  displays  showing  the  position  of  that 
aircraft  only  in  relation  to  the  map.  The  next  three  categories 
involve  nonvectoring  condition  flights.  The  first  of  these  included 
35  completed  flights  in  which  the  simulators  were  equipped  with  traffic 
situation  displays  showing  the  positions  of  all  airborne  aircraft 
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to  the  pilots.  The  second  of  the  nonvectoring  categories  included 
the  16  vectored  flights  of  simulators  with  no  TSD’s.  Experimental  runs 
in  the  nonvectoring  condition  were  heterogeneous  since  simulators  with 
and  without  TSD’s  were  in  the  air  at  the  same  time  in  a  ratio  of  about  2:1. 

The  last  category  of  the  nonvectored  condition  flights  included  all  51  flights 
of  the  two  previous  categories  (both  with  and  without  TSD  s.)  This  category 
reflects  the  overall  performance  of  the  two  to  one  mix  of  TSD  and  non  TSD 
aircraft  for  comparison  with  the  totally  vectoted  condition  flights. 


AIC  MANAGEMENT  PROFILES 


MEAN  VALUE5 


STANDARD  DEVIATIONS 
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Figure  2  A  Relative  Comparison  of  the  Means  and  Standard  Deviations 
of  the  16  Objective  Measures  for  Four  Types  of  Flights. 
Percentages  shown  are  Relative  to  the  lowest  Value  which 
is  not  shown. 
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When  a  flight  involved  a  missed  approach  or  "go  around",  the 

the  middle  ”ark^  ^  was  omitted  since  the  high  throttle,  aileron,  _ 
and  analysis.  titles  and  consequent  aircraft  accelerations  associatea 
efthIrcrrinrfroTJheMssed  abroach  tended  to  produce  markedly 
bimodal  distributions  for  these  variables. 

T.  _  nrofiles  of  Figure  2  compare  the  mean  values  and  standard 
.  .  The  r  first  sixteen  output  variables.  The  ATC  management 

deviations  of  the  fir  highest  mean  or  standard  deviation  for  a  given 

category  t  case  for  that  variable.  The  vertical  bars 

tne  per  cert  improvement  achieved  by  the  given  management  schemes 

indicate  p  variable.  For  instance,  ncnvectcr ed  TSD  s 

Uonvectored  con^^°a  ^.8a  35  r  cent  improvement  over  the  TSD  flights 
“1XSi°™!£bU  S  th.  eoiiid  result  for  .11  nonv.ctored  condition 
SlSS  was “boot  10  per  cent  better  than  the  TSD  category  alone. 

F'^ure  2  shows  that  the  vectoring  condition  produces  pcor  resuxts 
•  u  final  error  measures  both  with  respect  to  mean  va..ues  and  standard 
f  ™  VectoHn”  also  resulted  in  consistent  but  poor  perrormance 

deviations.  activity.  However,  the  remainder  or  me  manua. 

with  respect  to  alle*°"  *5*  £e  expediencv/economy  measures  and  passenger 
workload  measures,  as  well  _  1  '  r  the  respective  worst  cases. 

comfort  “*s";STg;n;SgSf produced  poor  results  in  final  airspeed  and 

The  nonvectored  TSD  fUgh  P  looe  and  headl!,g  errors  were  low.  This 

final  lateral  er--.=,  a*-.  •  t  t0  aiieron  activity  anc 

CateSOtI nfroirlcceleratln  while  requiring  the  greatest  amount  of  elevator 
consequent  rol  #  o  “insncs  measures  seer*  wO 

activity  and  pitch  acceleration.  ?  condition  runs  without 

follow  the  vectoring  protil coSrion)  outperformed  ail  ciner 

categories" otTche" final  Lror  neesures.  however,  this  cetegoty  produced 
the  worst  results  for  aloost  all  other  measures. 

_  -  set  of  pair-wise  comparisons  of  the  various  A.C 
8  n  riVi.  comparing  flights  on  the  16  measures, 
management  categories  ‘  vectoring  with  the  combined  nonvectoring 

Tee  first  comparison  contrasts  ««  «  yonvectotir.g  appears 

category  (i.e.,  no  distinct  error  measures  while  the  reverse  is 

“  the  other  —res.  The  aileron  activity  favored 

rvactori« .  -  -- 

v“  toting”  to  produce  ..or.  siSlifStly'mo™* 

F  tests  showed  the  heading  and  1  .  variance  in  aileron 

for  the  vectoring  flights  however. 
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PAIR  WISE  COMPARISON  OF  ATC  MANAGEMENT  CATEGORIES 

MEAN  VALUES  STANDARD  DEVIATIONS 
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Figure  3  Means  and  Standard  Deviations  cf  the  16  Objective  Measures 

Compared  for  the  Four  Types  of  Flights  on  a  Pair-Wise  Basis. 

The  Favoring  Direction  of  a  Measure  is  Indicated.  Levels 
of  Statistical  Significance  are  Shown. 

Comparison  of  the  vectoring  and  nonvectoring  (TSD)  flights  generally 
favored  the  latter  category.  The  nonvector ed  flights  had  significantly 
lower  aileron  activity  and  consequent  roll  acceleration.  The  variance 
of  FHE,  FLE,  DT,  TAC,  and  STM/DT  was  significantly  less  for  the 
nonvectored  TSD  flights  while  the  variance  for  aileron  activity  significantly 
favored  the  vectored  aircraft. 

Comparison  of  vectoring  condition  flights  with  the  nonvectored 
condition  non  TSD  (i.e.  vectored)  flights  showed  the  latter  to  perform 
better  with  respect  to  final  error  measures  and  aileron  and  elevator 
activity,  while  vectoring  was  favored  on  most  other  measures.  No  comparison 
of  mean  values  between  these  categories  was  significant.  The  nonTSD  vectored 
flights  in  the  nonvectoring  condition  produced  less  variance  in  all  of  the 
final  error  measures  and  three  out  of  four  of  these  comparisons  were 
significant  beyond  the  .01  level.  Vectoring  condition  flights  produced 
lower  variances  in  all  other  measures  although  only  aileron  activity, 
roll  acceleration  and  intercrossing  time  were  significantly  lower. 
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Finally,  the  nonvectored  TSD  flights  are  compared  with  the  nonTSD 
(vectored)  flights  in  the  nonvectoring  condition.  The  omy  significant 
dif f erences  in  mean  values  show  TSD  flights  to  be  superior  with  respect 
t0  aileron  activity  and  roll  acceleration.  The  nonTSD  flights  have  lower 
variances  in  all  of  the  final  error  measures  (two  out  of  four  were  signifi¬ 
cant)  while  the  TSD  flights  had  significantly  lower  variances  in  all  of  the 
expedience/economy  measures.  The  variance  in  throttle  average  change 
also  favored  the  TSD  f] ights. 

A  brief  summary  comparing  the  conditions  on  the  relative  favorable¬ 
ness  of  each  measure  (mean  and  standard  diviation)  shows  the  following. 

0  Vectoring  condition  superior  to  nonvectoring  condition  overall. 

Vectoring  showed  more  favorable  biases  than  nonvectoring  although  the 
most  significant  differences  favored  nonvectoring  as  did  the  final  error 
measures.  Intercrossing  times  favored  vectoring. 

0  TSD  nonvectoring  superior  to  vectoring. 

The  truly  nonvectored  flights  (those  with  TSD)  compared  to 
the  ^lights  in  the  vectoring  condition  showed  more  favorable  biases 
(10-to-6  for  both  means  and  standard  deviations)  including  inter¬ 
crossing  times.  In  addition,  the  7  significant  differences  all  favored 
the  TSD  flights. 

0  Vectoring  condition  superior  to  vectored  nonTSD. 

The  flights  in  the  vectoring  condition  showed  more  favorable  biases 
than  the  vectored  flights  (noTSD)  in  the  nonvectoring  condition,  most 
noticeably  in  the  standard  deviations  (12-to-A)  although  again  the  most 
significant  differences  favored  the  nonvectoring  nonTSD  flights  as  did 
the  final  errors.  Intercrossing  times  favored  the  vectoring  condition 
■  although  not  significantly. 

0  TSD  nonvectoring  superior  to  vectored  nonTSD. 

Within  the  nonvectoring  condition,  the  TSD  (nonvectored)  flights  showed 
more  favorable  biases  chan  the  nonTSD  (vectored)  flights  including  inter¬ 
crossing  time.  However  the  final  errors  were  smaller  and  more  consistent 
for  the  nonTSD. 

Thus,  based  solely  on  objective  measures,  the  TSD  (nonvectored)  flights 
were  superior  to  vectored  flights  whether  or  not  all  the  flights  in  the 
condition  were  vectored  or  just  those  flights  lacking  a  TSD.  Or.  the 
other  hand  these  flights  that  were  vectored  in  the  mixed  (nonvectoring 
condition  were  somewhat  better  on  final  erroru. 

B.  Multidimensional  Analyses 

A  series  of  multivariate  analyses  of  variance  (MANOVA)  and 
discriminant  analyses  were  performed  on  the  data  using  the  ATC  management 
schemes  as  the  independent  or  grouping  variable.  Thus,  there  were  three 
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fiiphts  nonvectored  condii-ion  flights  with 
groans:  vectored  conafi d  J  hgf  s  without  TSD’s.  The  purpose  of 

TSD’s,  and  non.ectored  condition  flights  differences 

these  analyses  was  to  determine  if  ««e^were  ^  #  subseC>  of 

among  the  3  groups  in  ;J  h  differences  are  present,  as  a  priori  analysis 
SCSS-t  variates^ should  express  these  differences  in  a 
readily  interprecable  fashion. 

The  first  step  in  the  analysis  was  to  break  down  the  total  variance 
n£  seventeen  variables  into  a  partiwion 
in  the  measurement  batte  differences  among  the  three  groups  and  a  partition 
variance  which  contained  ®  ^  hypothesis  H,,  equal 

»:  rrtisuc*. 

three  groups.  It  was  found  that  this  hypothesis  could  be  rejected  with 
a  probability  of  false  rejection  less  than  0.1. 

Inspection  of  univariate  F  ratios  for  the  individual  measurement 

variable^  sc.alo.r^ 

““""“i  h  I  sShr  "prcSSntyhf  file  rejection.  The 
Hb«  fables  vss  selectively  end  finelly  reduced  In  ensuing  .nelyses 

to  3. 

The  eight  member  battery  included  the  variables  HPL.STMMRRA, 
rrr  F1F  DT  and  ICT  and  enabled  rejection  of  H2  with  a  probability  ct 
FGE,  F.  ,  ,  0.01.  The  discriminant  analysis  performed 

false  rejection  at  less  than  orthogonal  components  of  the 

on  this  bJ“^f?^h1rovide^  S£f1c.nc  discrimination  among  the  groups', 
battery,  both  erouo  ^eans  on  the  first  component  was 

The  difference  among  the  th  8  •  “his  component  is  defined  in  terms 

-nrh  of  the  eight  variables  and  -•  .ivan  in  .randardlc.d 

form  by  the  following  equation. 


n  =  .067  HFL  -  .328  STM  +  .532  AA  +  .372  RRA 
1  -.323  FAE  +  .377  FGE  +  .320  DT  +  .397  7CT 


(1) 


1  ,„i  Of  ■sienif  icance  of  differences  among  group  means  on  the  second 
S^nlft  :L  b«£n S?05  and  0.02.  This  c»po»«»t  is  given  by  the 
following  equation: 


D2  =  -  .025  HPL  + 
-  .221  FAE  + 


.491  STM  -  1.299  AA  +  1.132  RRA 
’.109  FGE  +  .307  DT  +  .481  ICT 


(2) 
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The  structure  matrix  for  these  components  was  computed  using  the 
total  (before  partitioning)  variance  in  the  test  battery.  This  structure 
matrix,  as  well  as  the  communalities  achieved  by  the  structure  and  the 
univariate  F  ratios  computed  in  the  MANOVA  are  presented  below. 


Variable 

Structure 

Communal ities 

Univariate  F  Ratios 

1  2 

HPL 

.173  .246 

,091 

.72 

STM 

-.056  .312 

.101 

.70 

AA 

.709  -.324 

.608 

6.5b 

RRA 

.602  .129 

.379 

4.06 

FAE 

-.147  -.270 

.095 

.72 

FGE 

.231  -.226 

.04 

.89 

DT 

.217  .298 

,36 

1.10 

ICT 

.307  .334 

.206 

1.176 

The  elements  of  the  structure  matrix  are  product  moment  correlations 
between  the  variable  (row)  and  the  discriminant  component  (column). 

The  square  of  an  element  is,  therefore,  the  proportion  of  total  variance 
in  the  variable  that  is  used  by  the  component.  The  communalities  for 
the  variables  are  the  sums  of  squares  across  the  rows  of  the  structure 
and  are,  therefore,  the  proportions  of  total  variance  in  the  variable 
that  are  utilized  in  -he  discriminant  space  defined  by  the  two  components. 
The  univariate  F  ratios  and  communalities  both  indicate  similar  orders 
of  importance  for  the  variables  in  this  measurement  battery.  The  aileron 
activity  is  the  most  important  differentiator  of  the  group.  The  other 
variables  are  RRA,  ICT,  DT,  FGE,  STM,  FAE  and  HPL  in  descending  order  of 
importance. 

The  mean  group  scores  on  the  two  discriminant  components  are 
plotted  in  Figure  4.  Each  group  mean  is  at  the  center  of  an  eclipse 
and  the  ellipses  define  regions  of  the  discriminant  space  containing  about 
67%  of  their  respective  populations.  The  vectors  emanating  from  the  origin 
of  the  space  are  projections  of  the  measurement  vectors  on  the  discriminant 
plane  indicating  their  directions  of  increase  in  the  discriminant  space, 
while  their  lengths  indicate  their  relative  importances  as  discriminators 
of  the  groups. 


191 


The  '1AN0VA  and  discriminant  analyses  have  established  that  the 
level  of  significance  for  differences  in  the  smaller  subset  of  eight 
variables  is  beyond  the  0.01  level.  At  this  point  it  would  be  appropriate 
and  informative  to  perform  a  principal  components  analysis  and  factor 
analysis  on  the  data  to  determine  the  number  of  independent  components  of 
variance  present  in  the  seventeen  output  variables  and  to  find  a  rotation 
of  the  components  which  is  maximally  interpretable.  Since  the  MANOVA 
showed  significant  differences  among  the  groups,  these  further  analyse! 
must  be  performed  on  a  correlation  matrix  obtained  from  the  pooled 
within  groups  dispersion  matrix.  Use  of  this  within  groups  partition 
of  variance  will  eliminate  the  differences  between  groups  and  give  a 
more  accurate  view  of  the  inter-relationships  among  the  individual 
variables.  This  analyses  is  being  performed. 


C.  Conclusions  from  Objective  Analyses 

Therefore  based  on  the  above  analyses,  it  appears  that  the  flights 
that  merged  and  made  approaches  from  sequence  order  information  only  (i.e. 
via  TSD)  achieved  distinctly  different  and  better  individual  and  system 
performances  than  vectored  flights.  On  the  other  hand  and  perhaps  some¬ 
what  counterintuitively,  those  flights  that  were  vectored  in  a  mix  of 
vectored  and  nonvectored  ones  were  also  distinctly  different  from  flights 
in  which  all  were  vectored.  This  perhaps  indicates  that  controllers 
fc-nd  it  more  difficult  to  vector  A/C  in  a  nonhomogeneous  mix  than  when 
vectored  even  thoogh  the  A/C  "workload"  judged  by  vectored  A/C 
numerosity  was  1/3  lower  in  the  mixed  condition  than  in  the  all  vector 
condition. 


2.  Verbal  Measures 

The  verbal  communications  during  each  run  were  tape  recorded  and  ^ 
later  transcribed  to  hard  copy  for  further  analysis  of  the  verbal  workload 
for  both  vectoring  and  nonvectoring  flight  simulations.  As  before,  th 
traffic  situation  display  flights  and  those  flights  without  traic 
situation  displays  are  compared,  and  more  specifically  the  flights  with 
TSD  are  compared  to  flights  with  the  vectoring  condition  and  to  vectored 
flights  within  the  nonvectoring  condition.  (Vectored  pilo“  J^1"8  "  , 

the  nonvectoring  condition  will  be  referred  to  as  pilots  without  a  TSD.) 


A.  Word  Rates  and  Word  Counts 

Running  cummulative  word  rates  (averaged  over  successive  60  second 
intervals)  are  shown  for  pilots,  controllers  and  the  total  group  in 
Figure  5a,  5b,  5c. 
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In  these  figures,  rates  are  plotted  against  the  running  cummulative 
word  count  permitting  both  to  be  compared.  Comparisons  must  be  made 
at  equal  numbers  of  plot  points  (60  sec.  intervals).  One  immediate 
feature  of  the  graphs  shows  a  high  initial  word  rate  which  decreases 
sharply  followed  by  a  subsequent  increase  from  which  point  the  rate 
continues  to  decrease  for  several  minutes.  While  the  exact  behavior 
is  no  doubt  task  speckle  this  same  general  behavior  was  observed  m  a 
previous  experiment*'' 

Figure  5a  which  combines  the  total  pilot  and  controller  communication 
presents  a  fair  comparison  of  the  conditions.  Vectoring  and  nonvectoring 
conditions  had  the  same  total  word  count  (300)  although  the  initial  rates 
were  lower  for  nonvectoring.  However,  where  the  two  types  of  flights  (TSD, 
NTSD)  within  nonvectoring  are  compared  a  large  difference  is  apparent. 

Both  the  initial  word  rate  and  final  word  count  were  about  75%  greater  for 
the  NTSD  flights  than  for  the  TSD  ones.  This  type  of  finding  is  completely 
consistent  with  previous  experiments.  An  interesting  and  somewhat, 
unexpected  finding  is  that  the  word  rate  for  the  vectored  flights  in  the 
nonvectoring  condition  (i.e.  nonTSD)  was  considerably  higher  even  than  for 
flights  in  the  vectoring  condition.  This  phenomenon!  was  observed  for  each 
of  the  three  groups  individually  so  that  it  appears  to  be  fairly  robust. 

When  just  the  pilot  communication  is  looked  at  in  Figure  5b,  the 
differences  between  these  4  conditions  are  not  as  marked  although  again 
the  NTSD  flights  had  a  higher  initial  word  rate  and  final  word  count 
than  either  of  the  other  3  conditions.  TSD  (nonvectored)  flights  had 
the  lowest  initial  word  rates. 

Figure  5c  shows  that  the  effects  of  the  four  conditions  were  most 
pronounced  for  the  controllers’  communication  with  the  sane  ge neral 
findings  holding  as  before.  The  nonvectored  flight  (TSD)  had  lowest 
verbal  workload  in  terms  of  initial  word  rate  and  total  word 
count  while  the  vectored  flight  in  the  nonvectoring  condition  (NTSD) 
had  the  highest  word  rate  and  total  word  count. 

The  group  consistency  effect  is  shown  visually  in  figure  6  as  a 
means  of  suggesting  the  robustness  of  the  previous  findings.  Figure  6 
plots  the  normalized  differences  in  total  word  counts  for  each  comparison. 
The  controller  groups  were  quite  consistent  across  the  comparisons 
with  somewhat  less  consistent  results  for  the  5  pilot  groups. 
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Figure  6  Pair  Uls.  Co.parison  of  Total  Uord  Counts  in  Each  FliSht 
Type  for  the  Three  Groups. 


B.  Content  Analyses 

Content  analyses  are  being  performed  on  the  verbal  data.  The 
results  to  data  show  that  the  pilots  with  TSD's  spent  significantly 
less  time  discussing  direction  and  heading  than  under  either  of  the  t 
vectoring  situations.  And  not  too  surprisingly,  controllers  communicating 
with  the  nonvectored  flights  (TSD)  made  fewer  references  to  speeds  than 
when  communicating  with  the  vectored  flights. 

C.  Conclusions  Based  on  Verbal  Analyses 

The  verbal  workload  for  controllers  is  considerably  reduced 
in  both,  word  rate  and  total  words  by  taking  advantage  of  the  pilots 
TSD  capability.  Pilots  with  TSD  have  essentially  the  same  work  rate 
and  total  word  count  as  when  they  are  in  an  "all  vector"  environment. 

Thus  the  lower  controller  verbal  workload  in  the  sequencing  condition 
is  not  reflected  in  a  higher  verbal  workload  for  the  pilot.  This  finding 
is  essentially  the  same  as  in  a  previous  experiment.*. 

The  effect  of  a  mixed  traffic  information  environment  however  is 
surprising.  The  verbal  workloads  for  both  controllers  and  pilots  being 
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vectored  in  the  mixed  control  environment  was  markedly  higher  than  when 
the  environment  was  not  so  mixed.  This  may  be  caused  by  an  increased 
difficulty  in  vectoring  A/C  around  in  a  space  also  being  used  in  a 
"VFR"  mode  by  other  pilots.  It  might  also  simply  reflect  the  increased 
time  the  controller  has  when  some  of  the  A/C  do  not  need  vectoring 
causing  him  to  increase  his  communication  with  the  remaining  A/C. 
Analyses  to  determine  the  pertinence  of  this  increased  communication 
is  being  pursued. 

In  practice,  this  finding  might  suggest  a  necessity  to  segregate 
TSD  and  NonTSD  equipped  A/C  handling  each  group  separately  on  a  "VFR", 
"IFR"  basis. 

3.  Subjective  Measures 

A.  Questionnaire  Results 

Subjects  filled  out  evaluation  sheets  after  each  run  and  a 
final  questionnaire  at  the  conclusion  of  all  of  their  runs.  Figure  7 
summarizes  some  of  the  averaged  results  obtained.  Subjects  were 
asked  to  indicate  their  assessments  by  placing  tic-marks 
on  an  ungraduated  line.  Thus  placement  of  a  mark  indicates  not  only 
relative  ordering  but  strength  of  the  assessment  as  well. 


The  overall  preference  for  the  two  main  conditions  is  shown  by 
System  Evaluation  on  the  final  questionnaire.  As  in  a  previous 
experiment/1^  controllers  preferred  vectoring  to  nonvectoring  while 
pilots  preferred  the  nonvectoring  (i.e.  mixed)  envi’  onment  even  through 
they  had  experienced  both  TSD  flights  and  nonTSD  flights  (in  which 
they  were  vectored).  The  strengths  of  the  two  choices  were  essentially 
identical  for  pilots  and  controllers. 

The  pilots  also  felt  that  the  nonvectoring  condition  was  safer 
than  vectoring  although  again  controllers  had  the  reverse  opinion. 
However,  a  more  detailed  analysis  obtained  after  each  run  shows  that 
controllers  felt  the  nonvectoring  condition  to  be  slightly  safer  than 
vectoring.  The  pilots  judged  the  TSD  (nonvectored)  flights  as  most  safe 
and  the  vectored  flights  in  the  nonvectoring  environment  (NT)  as  least 
safe,  even  compared  to  being  vectored  in  an  all  vector  environment. 
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Figure  7  Pilot  and  Controller  Averaged  Responses  to  Questions  After 
each  Run  and  After  the  Experiment.  The  Final  Questionnaire 
Compares  the  Two  Basic  Conditions.  The  Run  Evaluation 
Separates  TSD  (T)  from  NonTSD  (NT)  Flights  in  the  NV  Condition. 
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In  fact,  pilots  rated  the  vectored  flights  in  a  mixed  environment 
(NT)  lowest  on  the  three  primary  FAA  Criteria  of  Safety,  Orderliness 
and  expeditiousness.  The  fact  that  NTSD  received  lowest  marks  on  these 
subjective  evaluations,  on  the  verbal  measures  (subjective  and  objective) 
and  on  many  of  the  objective  measures  is  a  very  strong  finding. 

Pilots  felt  the  total  workload  to  be  essentially  the  same  regardless 
of  the  condition  (V,  NV)  or  the  type  of  flight  made  within  the  NV 
condition.  Controllers  quite  clearly  ranked  the  vectoring  condition 
as  having  a  higher  (worst)  verbal  and  total  workload  whether  assessed 
after  each  run  or  in  retrospect  at  the  end  of  the  experiment.  Pilots 
and  controllers  also  clearly  assessed  the  vectoring  condition  as 
requiring  a  higher  verbal  workload. 

B.  Conclusions  from  Subjective  Measures 

fairly  consistent  theme  is  that  even  though  pilots  preferred 
the  nonvectoring  condition  to  vectoring  on  the  whole,  they  least  liked 
being  vectored  in  an  environment  where  others  were  flying  VFR.  Pilots 
quite  consistently  preferred  flying  VFR  (even  in  the  mixed  condition) 
to  being  vectored.  Controllers,  acknowledged  that  the  nonvectoring 
condition  as  a  whole  had  a  lower  workload  and  did  not  necessarily 
express  any  large  difference  between  the  two  conditions  on  the 
three  primarv  FAA  criteria.  However,  in  an  encompassing  evaluation, 
controllers  preferred  to  operate  in  the  wholly  vectoring  environment. 

The  above  remarks  are  quite  in  keeping  with  a  previous  experiment 
particularly  as  to  pilot-controller  system  preferences.  Pilots  want 
to  riy  VFR  after  a  sequence  order  is  established  and  controllers  want 
a  strong  ground  centralized  system  regardless  of  the  workload  decrease 
possible  in  a  distributed  management  mode  of  control. 


CONCLUSIONS 


The  conclusions  offered  are  based  on  the  objective,  verbal  and 
subjective  analyses  presented  aboved.  In  considering  the  reality 
of  an  environment  in  which  some  A/C  have  TSD  while  others  don  t,  it 
should  be  remembered  that  this  situation  was  predicated  on  two  possi¬ 
bilities,  the  first  could  be  the  failure  of  an  onboard  TSD  in  a  homogeneous 
TSD  environment  while  the  second  follows  the  simple  realization  that 
such  a  mixed  environment  could  be  quite  natural  based  on  the  cost  of 
TSD  equipment. 

The  analyses  support  the  primary  conclusion  that  the  mixed  environment 
simulated  can  be  managed  effectively  on  a  distributed  management  basis 
in  which  those  A/C  with  TSD  are  given  initial  sequence  orderings  and 
then  fly  "VFR"  while  the  I/C  without  TSD  are  vectored  and  speed  controlled. 
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Since  this  mixed  condition  .3  a  -hole  produ  “uSes'noc 

good  as  the  *lLj*£"£|  insertion  of  nonTSD  egaipoed  A/C 

tSPtfK?“affic  has  any  teal  disruptive  effect.  (Of  course  neither 
one  of  these  possibilities  is  necessarily  advocated.) 

The  TSD  equipped  A/C  °Pe"“”8  e“generally 

(controllers .  issue  sequence  pi ™  >Pot  wlc,  verbal 
better  results  than  when  erably  lower,  intercrossing  spacing 

“H"  r^tTv"?uss  variability >  etc.  and  this 
mode  of  control  had  clear  pilot  acceptance. 

— ,  —  .  \/r  in  the  mixed  environment  required 

However,  the  n0^P  ^Jan^n  an  aii-vector  environment,  generally 
more  verbal  communication  t  least  well  accepted  by  pilots 

achieved  lo.est  perfon»nc.  scores  and  least  »  1o  th. 

even  though  from  the  pilots  “  “„£  .ssur,„ce  by 

all-vector  c0?d^°“jR  ±  ”  vFR-IFR  environment  without  knowledge 

ofM«e7tLr  tr!ffl=  is. 

r sss'SfS^s 

flying  VFR. 

,  r  fLP  nresent  experiment,  spatially  segregating 
In  the  c0^tex"  °fSDC  wcP^d  using  distributed  management  for  TSD 
the  TSD  fromst^rtl^2A iSoSS  form8 the  basis  for  a  considerably 
bet  ter*”  overall^  system  than  either  a  mixed  or  all-vector  one. 

..  rpsults  with  distributed  management  for  TSD 

in  comparisonCto^vectoting  is  fully  la  hasping  with  ptavious  axp.ti.ants 
at  Ames  Research  Center. 
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SUMMAHY 

*  Krli_  research- in-prcgre s s  is  to  investigate  the  use  which 
The  purpose  ox  uiu  r-  e  *  Vf  9nroute  through  the  terminal  area  and 
pilots  make  of  a  moving  m  -  ^  -plight  nhases.  Various  features  of  each- 

including  the  ^  the  electronic  moving  map  Multifunction  Dis- 

of  the  primary  Indicator  (HSI),  and  Electronic  Attitude 

play  (ICT),  Hori-o-tal  .  ^  us^d  in  ths  ihree  phases  of  flight  mentioned. 

Director  In^^or_EMI) rated  in  each  of  three  ways:  a)  xlcwn  m 
above  when  the  sTOLATlD  1n  .  monitoring  role:  b)  flcwr. 

the  fully  automatic  mode  withrthe^nce^  rfcduce  wcrvload  and  task  require- 
manually  usmg^f-i=nt  Q°  ,  flown  manually  using  raw  instrument  situa- 

ments  in  an  ^CGSP  '  f_  ^rd  and  workload  measurements,  coupled  with  task 
tion  data.  Eye-point -oi  r-°  ..  ratings  and  pilot  comments  are  pre¬ 
performance  measurements,  to  determine  the  pilots’  use  of 

sente a.  .  ,  r  ,P  of  hpr  disnlays.  The  measurements }  ratings 9 

the  MFD  in  conjunction  with  the  other  J?  *  f  th  I4FD  as  a  supplement 

and  cements  provide  an  i^ioation  of  tn,  „£  hclal!!8 

to  the  HSI  for  improving  flight  *aiety  -n  ioi±uv  xi6 

patterns. 

nnsoDUCTioK 

ml.  e  —  ,-s  designed  to  provide  for  a  systematic  comparison  of  an 

This  research  is  design-  5  a  -n(1  an  electromechanical  Horizontal 

electronic  Multifunction  Duplay  other  in3truments  (EASI,  alti- 

Situation  mdicator  (Hjl)  3  ^  HAS  A/Ames  Research  Center  STOLAUD . digi- 

meter,  airspeed  d  control  of  powered-lift  short-haul  air- 

crafTTi!  research  foms  one  part  of  the  joint  DOT/NASA  STOL  Operating 
Systems  Experiments  Program. 


- 7— — 7-.-  on  work  done  for  the  Aircraft  Guidance  and  navigation 

BranS;%ligW  Syc^He search  Division,  Ames  Research  Center,  under  Contract 

HAS 2-8973.  ^  .  .  v  h„ 

“Siting  SSSStT^^nS  Stt’Sn-ft. 
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The  objective  of  the  joint  DOT/NASA  3T0L  Operating  Systems  Experiment 3  Pro¬ 
gram  is  to  provide  data  to  aid  the  design  of  terminal  area  guidance,  navigation 
and  control  systems  and  the  definition  of  operational  procedures  for  powers s- 
lift  and  light  wing  loading  short-haul  aircraft  under  IFR.  As  a  first  step-in 
this  program  experimental  digital  automatic  and  flight  director  guidance  ana 
control  systems' have  been  developed  for  the  NASA  Augmentcr  Wing  powered-lift 
short-haul  aircraft  by  Sperry  Flight  Systems  (under  NASA  contract).  This 
system,  called  STOLAND,  is  based  on  the  application  of  current  CTOL ^system 
techniques  and  displays  to  the  experimental  short-haul  aircraft  (Ref.  1). 

Two  of  the  primary  displays  used  in  the  system  are  an  Electronic  Attitude 
Director  Indicator  (EADI)  and  a  standard  Horizontal  Situation  Indicator  vHSI). 

In  addition,  this  digital  system  has  a  computer  driven,  cathode  ray  displa- 
called  the  Multifunction  Display,  or  MFD,  which  displays _the_ aircraft  position 
and  predicted  motion  on  a  moving  map  of  the  area..  Also  displayed  are  ether, 
status  data  including  heading,  altituae,  raw  r.avaid  data  and  reference  fli^nt 
paths.  Annotated  illustrations  of  each  of  the  primary  displays  are  shown  in 
viers.  1-3,  and  a  view  of  the  pilot's  instrument  panel  in  she  3TCLAND  simulator 
used  for  this  investigation  is  show,  in  Fig.  4. 

This  paper  presents  a  preliminary  review  of  some  of  the  comparative  measure¬ 
ments  and* pilot" opinions  from  a  flight  simulation  in  February  and  March  197c. 

Tir  experimental  program  for  comparing  the  MFD  and  H3I  wiohin  the  context  cf 
the  whole  cockpit  will  be  summarised  first.  There  after  presenting  some  of  the 
key  results,  we  shall  recapitulate  our  tentative  findings  in  the  concluding 
section  of  this  paper. 


BSEERBE1HAL  PROGRAM 


The  experimental  investigation  va  conducted  cn  the  IiASA/Ames  ft 
3T0LAND  simulation  facility.  This  facility  includes:  a)  a  fully  in 
cockpit:  b)  a  six-degree-of- freedom  C-3A  augment  or  Wing  Jet  3T0L  Res 
Aircraft /environment/navigation  simulation  program  implemented  on  an 
digital  computer;  and  c)  a  complete  3T0IAID  digital  avionics  system. 


xed-base 
.surumented 
earch 
.  EAI  8400 


If  the  display  content  has  been  suited  to  the  task,  the  dispLay  format  *::i 
symbology  will' usually  be  crucial  only  if  the. pilot  is  at. a  saturated  level  of 
workload  in  a  realistic  flight  simulation  or  in  actual  fught.  Consequently, 
we  attempted  to  emphasise  the  realistic  air  route  navigation,  guidance  and 
control  environment  for  short-haul  aircraft  in  preparing  the  experimental 
design  summarized  in  Table  1.  Three  classes  of  independent  variables  are 
shorn  in  the  table.  The  level  of  pilot  involvement  in  guidance  and  control 
tasks  is  divided  between  two  independent  classes,  one  sf  which  we  hive  cammed 
"technique,"  i.e.,  eicher  manual  or  automatic,  and  the  other  of  which  we  have, 
called  "the  level  of  display,"  i.e.,  either  situation  (raw  data  only)  or  flight 
director  and  situation  on  the  EADI  with  the  H3I  and  MFD  -he  obvious  independent 
display  variables  for  comparison. 

The  flight  phases  of  interest  in  this  experiment  were  fourfold:,  a)  enreute 
within  5 6  tan  (30  nm)  of  Crows  Landing,  A1F,  Coiusa  Count’. ,  Cals  ornia;  b)  t..e 
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BLINKS  FOR 

DECISION  HEIGHT  ROLL  ROLL  PROGRAMMABLE 

PITCH  COMMAND  BAR  ANNUNCIATION  SCALE  INDICATOR  READOUT 
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HEADING/ TRACK  TAPE 


Figure  e.  Multifunction  Display  ( MFD ) 
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ORIGINAL  Pa  nr. 


Figure  4.  STOLAND  Sinmlator  Instrument  Panel 
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TABLE  1  .  SUMMARY  OF  EXPERH'IENTAL  DESIGN 


[  PILOT - 
j  ING 
TECH¬ 
NIQUE 

LEVEL  OF  DISPLAY 

NAVIGATION  AND 
CONTROL 
WORKLOAD 

SITUATION 
(RAW  DATA) 

FLIGHT  DIRECTO] 
SITUATION  ON  ] 

AND 

EADI 

HSI 

MFD 

HSI 

MFD 

BOTH  HSI 
AND  MFD 

Manual 

Tracking  a  STAR 
sequence 

yxscc* 

Highest 

Workload 

J 

xscc* 

J 

y 

y 

EPRf 

Select  different 
radio  navaids  en- 
route  for  STOLAND 
and  maintain  geo¬ 
graphic  orientation 

y 

y 

J 

y 

y 

EPRt 

Auto¬ 

matic 

Tracking  a  STAR 
sequence 

Lowest 

Workload 

10  cells  X  2  replications  x  5  pilots  =  100  runs 


*XSCC  —  Measurement  ox*  excess  control  capacity  with, 
cross-coupled  secondary  control  task 

t  gppv  =  Measurement  of*  eye -point -of- regard 


Dependent  Variables  (i.e.,  Measurements 


a. 


b. 


c. 


d. 


e. 

f. 


Flight  plan  performance  errors: 

•  Airspeed  error  with  respect  to  commanded  flight  profile 

•  Lateral  distance  error  with  respect  to  commanded  course 

•  Altitude  or  glide  slope  displacement . error 

•  Elapsed  time  between  waypoints  in  flight  plan 


Uher  aircraft  motion  and  control  variables  (e.g.,  pitch  and  roll 
attitudes,  pitch  and  roll  rates,  heading,  turn  rate,  airspeed,  iner¬ 
tial  velocity,  angles  of  attack  and  sideslip,  course  and  path  angles 
[or  ground  and  vertical  velocities],  translational  accelerations] 


Eye-point-of-regard  in  azimuth  and  elevation 

Subjective  display  ratings  (e.g.,  controllability-and-precision, 
status  utility,  clutter,  attentional  demand) 


Excess  control  capacity 

Caution  advisory  response  latency ^ (from  a  light -cancelling  task 
designed  to  measure  the  pilot Ts  simple  reaction  time) 
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terminal  area  itself;  c)  the  landing  approach;  and  d)  the  gc-around  involving 
a  holding  pattern  after  a  missed  approach.  We  included  the  four  flight  phases 
within  a  class  of  independent  variables  representing  the  level  of  pilot  involve¬ 
ment  in  air  navigation  tasks,  i.e.,  whether  the  pilot  is  just  following  a  stan¬ 
dard  terminal  arrival  route  (STAR)  assigned  initially  by  a  controller  or  whether 
the  pilot  is  additionally  involved  in  reselecting  radio  navaids  enroute  for 
STOLAHD,  both  during  arrival  and  after  executing  a  missed  approach. 

While  using  the  automatic  mode  of  STOLAHD  during  initial  training  sessions, 
we  discovered  that  automatic  operation  is  so  devoid  of  pilot  workload  that  a 
critical  conroarison  of  the  HSX  and  MFD  cannot  be  made,  because  the  pilot  is  not 
even  saturated  with  monitoring  tasks.  Therefore,  we  refocused  the  experimental 
design  on  only  the  manual  pilot-'  lg  technique  and  deferred  investigation  of  the 
automatic  technique. 

We  have  indicated  in  Table  1  the  cells  in  the  experimental  design  which 
were  most  relevant  by  check  marks.  We  have  also  indicated  the  cells  which 
involved  the  highest  and  lowest  workloads  and  the  two  cells  which  were  most 
amenable  to  eye-point-of-regard  comparison. 

Pilot  workload  is  high  to  begin  with  when  flying  the  C-3A  Augmentor  Wing 
manually  with  combinations  of  powered  and  aerodynamic  lift.  Since  the  several 
STAR'S  involved  holding  patterns  and  curved  paths  as  well  as  straight  segments, 
reliance  on  the  HSI  (and  EADI)  without  the  MFD  placed  the  highest  workload 
demand  on  the  pilot,  because  he  had  to  keep  track  of  his  position  mentally  with 
the  aid  of  his  enroute  and  terminal  area  charts  as  he  progressed  along  the 
assigned  STAR,  reselected  radio  navaids,  and  executed  missed  approaches,  go- 
arounds  and  holding  patterns. 

Since  the  pilot  will  scan  to  and  fixate  on  instruments  which  display  redun¬ 
dant  information,  there  is  a  danger  in  presenting  both  the  HSI  and  MFD  when  the 
pilot  is  required  to  fly  with  only  raw  situation  data.  Having  both  horizontal 
displays  may  actually  increase  his  scanning  workload  unnaturally  when  he  is 
already  saturated  or  oversaturated.  Therefore,  we  covered  the  horizontal  dis¬ 
play  which  was  not  being  evaluated  in  eight  cells  of  Table  1,  because  the  pilot 
will  scan  to  instruments  which  display  no  information  or  which  are  temporarily 
inactive,  if  given  the  opportunity.  However,  both  the  HSI  and  MFD  were  uncov¬ 
ered  and  presented  to  the  pilot  simultaneously  in  the  two  cells  in  the  extreme 
right  column  of  Table  1  when  the  pilot  was  using  the  flight  director  and  situa¬ 
tion  on  the  EADI  to  fly  manually.  We  expected  that  any  outstanding  bias  in 
the  partitioning  of  the  eye-point-of-regard  distribution  between  the  HSI  and 
MFD  might  afford  a  measure  of  pilot  preference  for  or  confidence  in  monitoring 
the  horizontal  situation. 


The  displacement  "window"  on  the  EADI  was  deleted  when  runs  were  made  to 
test  the  HSI  alone,  since  the  HSI  presents  lateral  and  vertical  deviation  any¬ 
way.  The  displacement  scaling  of  one  "half -window"  on  the  SADI  was  consistent 
with  the  displacement  scaling  of  one  dot  on  the  HSI,  viz.,  381  m  (1250  ft) 
laterally  and  30.48  m  (100  ft)  vertically. 


HSI 


In  this  connection,  we  should  empha 
are  not  strictly  equivalent,  hccaus 


size  that  the  content  of  the  MFD  and 
e  no  waypoint  nuntcx  o  o*  ~ j.  Oil  idsjx 
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and  no  heading  scale  appears  on  the  MED,  if  the  pilot  elects  the  north-up 
i"ipure^y0^unerical^randei3tmore°unauitableefor1:trackingethanaeTCn  the  counter- 

,  9  - .  ,  qq  4*  4  2  to  use  the  displacement  window  on  the  ±J>D I* 

on  the  glide  Jf  ,  f  ~%s  fche  tr3_cking  control  aspects  of  this  experiment 

plate]  with  the  MFD,  which  presents  a  horizontal  mov.ng  map  of  the  —me  iua 
and  approach  chart. 

H-ad^e  was  provided  on  the  EADI's  programmable  display  during  the  expe*-- 
ment  ?ei»cc4ended  to  each  pilot  that  the  MFD  be  used  in  the  course-  or 
Seading-^  or“tion  for  consistency  with  the  HSI  and  because  the  heading 
?!  I  n!  -h-  Tiro  atJoears  only  when  the  course-  or  heading-up  orientation  is 

?  j.  ~ However  th°  RMI  was  always  available  to  present  a  compass  rc^e  .n-en 
S  Si  Ws^ovSed  tathe  event  ««  .  pilot  elected  to  keep  the  MFD  north-np 
■hlc^rse  ^  experienced  disorientation  with  the  revolving  map  display  m  tur..-. 
Sr=ho4  JfSp  scale  on  the  MFD  was  left  to  the  pilot;  however,  he  was 
instructed  that  the  STAR  waypoint  numbers  would  appear  only  if  the 
0.5  mn/ia.  scales  were  selected. 

A  st^dy  wind  speed  of  20  kt  from  the  east  or  west  as  required  by  the 
„  .  V.  -Tl/ —  produce  a  nr e '/ailing  tailwind  enroute  was  used  ohrcu6nout  the 
experiment  to  increase  workload,  and  the  wind  velocity  dispersion  was  between 
3  a-id  h  ft /sec. 

iicc  usted  in  Table  1  is  the  minimum  number  of  100  runs  requirea  for  two 
replications ^of  S  cells  counterbalanced  for  order  effects  with  5  Pilots  _ 

(oler  160  runs  were  made.)  Pilots  1  and  6  are  research  J;  ^  5.  J 

ireccmSrcial  airline  pilots;  Pilot  5  is  a  general  aviation “Sff 
tor  and  engineering  pilot;  and  Pilot  2  remained  on  reserve  for  this  ,rpen._.. 
and  did  not  have  to  participate. 

Belcv  the  table  of  independent  variables  and  cells  there  appears  a  list  sum- 
maritiig  the  dependent  variables,  that  is,  the  measurements  which  we  made.  «11 
^asur-m-nts  are  self-evident  except  perhaps  "excess  control  eapacis/  wh  - 
ouasur--.-  of  the  spiral  divergence  required  to  load  the  pilco 

to°the  point  of  saturation  with  control  tasks  while  satisfying  primary  task  per- 
p°  one  or  error  criterion.  Excess  control  capacity  i- _ 

mea^red  by  increasing  the  spiral  divergence  until  a  stationary  value  is  reacne a 
bv  the  cross-coupled  adaptive  regulator  of  the  divergence  in  balanc-  ■■  . 

performance  error  criterion^ ™”^fcKKE  ^S^rSSmtir  ~ 
spiral  divergence  W  ®  “  which  represents  his  excess  control  capacity 

SarfspSlolhf tas? <£f.  2).  Xo  th.  pilot  flying  the  aircraft _ 

, ,  .  ^  *  -nn'rai  divpr^nc0  se^ms  like  a  malfunction  m  lateral  stabil-tj 

aSmeSon]  so  the  measjimeit  ein  he  ^de  while  the  flight  simulation  retains 
ixigh  face  validity . 
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Four  Simula  pilot  rating  scales  for  use  in  research  on  and  evaluation  of 
manual  control  displays  -fere  derived  and  used  in  the  pilot  experiments  reported 
in  Ref.  3  and  were  well-suited  to  the  present  investigation.  _  The  scales  shewn 
later  in  Table  6  are  of  interval-scale  quality  and  will  permit  averaging  and 
other  standard  parametric  statistical  analyses.  The  use  of  four  trait  cate¬ 
gories  (task  controllability  and  precision,  status  utility,  clutter,  and  atten- 
t Tonal  demand)  helped  to  separate  subjective  identification  of  these  often  con¬ 
founded  effects.  Rating  forms  for  the  HSI  and  MFD  were  filled  out  by  each 
■oilot  in  the  cockpit  at  the  conclusion  of  a  simulated  flight. 

RESULTS  OF  THE  EXPERIMENT 


Although  we  shall  continue  to  label  the  displays  being  compared  as  "HSI" 
and  "MFD"  for  conciseness  in  presenting  the  results  where  one  or  the  other 
horizontal  display  was  uncovered,  the  reader  should  clearly  understand  that 
"HSI"  means  "HSI  EADI  (without  the  displacement  window)  and  other  instruments" 
and  that  "MFD”  means  "EADI,  MFD  and  other  instruments."  By  design,  the  HSI  and 
o,Ye  being  compared  withzn  the  context  of  the  whole  oTOLAHD  dx^play  <xnd  con¬ 
trol  arrangement  in  the  simulator  cockpit. 

We  shall  now  turn  to  present  the  several  forms  of  comparative  results  cf 
the  experiment  under  the  following  subordinate  topical  headings: 

—  Blunders  —  Excess  monitoring  capacity 

—  Tracking  errors  —  Pilot  opinion  ratings 

_  Excess  control  capacity  —  Eye-point -of- regard 

Blunders 

About  160  simulated  flights,  each  lasting  from  10  to  25  minutes  in  time, 
were  conducted  among  four  standard  terminal  arrival  routes  (STAR's).  The  most 
dramatic  results  are  the  20  !*blunders"  partitioned  in  Tables  2  and  3*  Tabxe  d 
partitions  the  9  blunders  which  occurred  in  the  first  phase  of  the  experiment 
February  while  the  pilots  were  primarily  tracking  oTAR*s  as  reference  flight 
paths.  Table  3  partitions  the  remaining  11  blunders  which  occurred  during  ter¬ 
minal  area  and  enroute  flight  with  emphasis  on  geographic  orientation  (as  well 
as  tracking)  in  the  second  phase  of  the  experiment  in  March  involving  missed 
approaches,  gc-arounds  and  holding  patterns  with  three  different  radio  navaids. 
The  types  of  blunders  identified  include  loss  of  geographic  orientation,  loss  of 
altitude  awareness,  and  loss  of  roll  attitude  control  as  well  as  several  others. 

Five  blunders  involved  the  HSI  and  k  the  MFD,  while  tracking  reference 
flight  paths  exclusively  (Table  2).  However,  8  blunders  involved  the  HSI,  2  the 
MFD,  and  1  both  displays,  during  terminal  area  and  enroute  flight  with  emphasis 
on  geographic  orientation  (Table  3).  The  flight  director  was  (or  should  have 
been)  in  use  during  11  cf  the  20  runs  wherein  blunders  occurred.  Since  J  cf 
these  11  blunders  were  also  associated  with  the  HSI  in  Table  3,  the  combina¬ 
tion  of  using  the  HSI  for  orientation  with  the  flight  director  for  tracking 
while  selecting  different  radio  navaids  for  guidance  seemed  to  conspire  to  pro¬ 
duce  the  most  blunders.  There  were  no  blunders  involving  the  Mil)  ana  ilight 
director  in  Table  3  and  only  three  in  Table  2.  Therefore,  we  would  conclude 
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TABLE  2 

DISTRIBUTION  OF  BLUNDERS  WHILE  TRACKING  REFERENCE  HilGHT  PATHS 


■ 

RAW 

DATA 

■ 

BLUNDERS 

K3I 

MFD 

LC33 

of  geographic  orientation 

2 

None 

LC33 

of  altitude  awareness 

1 

None 

LC33 

of  roll  attitude  control 

1 

1 

CnHe 

r  crashes 

1 

GPIPa 

None 

FLIGHT  DIRECTOR 


2  path  intercept  point 


TABLE  3 

TENONS  C-  BLUNDERS  DURING  TERMINAL  AREA  AND  EUR  OUT! 
WITH  EMPHASIS  ON  GEOGRAPHIC  ORIENTATION 


RAW  DATA 


BLUNDER'S 


nSSSjb  BOTH  H3I 
DIRECTOR  j*ID  *-fT7D 

III  WITH  FD 
Hoi 


I 


! 


X 
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on  the  basis  of  the  blunder  distribution  alone  from  the  simulation  that  the 
MFD  seems  to  offer  a  worthwhile  improvement  in  safety.,  since  1 3  of  20  blunders 
.involved  runs  wherein  the  MFD  was  not  available  to  the  pilot 


Tracking  Errors 

The  least  dramatic  results  are  to  be  found  among  the  flight  plan  tracking 
errors  in  three  dimensions  (lateral  and  vertical  displacement  and  airspeed)  and 
the  related  variations  in  aircraft  motions.  As  one  would  expect,  the  flight 
director  provided  for  more  precise  tracking  of  the  assigned  altitude  and  the 
glide  slope  than  otherwise.  However,  there  was  no  consistent  evidence  of  dif¬ 
ferences  between  tracking  errors  with  the  H3I  versus  the  MFD  even  with  only  raw 
data.  Yet,  as  we  mentioned  in  beginning  the  discussion  of  results,  the  "MFD" 
imnlies  the  use  of  the  integrated  EADI  as  the  tracking  display,  and  occasionally 
better  altitude-keeping  performance  appeared  with  the  "MFD"  than  with  the  "K3I." 


Excess  Control  Capacity 

The  measurement  of  excess  control  capacity  was  provided  by  the  average 
cross-coupled  adaptive  spiral  divergence  in  selected  runs  with  either  the  K3I 
or  the  MFD.  The  null  hypothesis  of  equality  between  mean  values  of  excess  con¬ 
trol  capacity  within  comparable  pairs  of  waj/point  groups,  with  either  display 
arrangement  was  tested  for  significant  differences.  The  results  of  these  tests 
are  listed  in  Table  1  by  pilot  and  flight  plan.  The  column  heading  "neither" 

IA3LZ  ^ 

NUMBER  OF  COMPARABLE  PAIRS  OF  WAYPOINT  GROUPS  FOR  WHICH  OITE  OR 
THE  OTHER  DISPLAY  ARRANGEMENT  EXHIBITED  SIGNIFICANTLY  GREATER 
AVERAGE  EXCESS  CONTROL  CAPACITY  AT  THE  0.05  LEVEL2, 


PILOT 

FLIGHT  PLAIT 

HSI/EADI 

NEITHER 

EADI /MFD 

i 

2 

1 

k 

1 

3 

2 

4 

1 

3 

3 

1 

1 

L 

k 

2 

5 

1 

6 

5 

2 

J_ 

k 

Totals 

11 

11 

16 

aThe  null  hypothesis  is  "neither."  The  probability  of  rejecting 
the  null  hypothesis  when  it  is  true  is  0.05.  Behrens',  Scheffe's, 
and  TnVpyMs  tests  produced  consistent  results. 
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results  under  the  ergodic  hypothesis,  because  the  number  r f  samples  available 
within  each  waypoint  group  was  on  the  order  of  several  hundred  or  more. 

The  column  headings  "HSI"  or  "MFD"  identify  the  numbers  of  comparable  pairs 
of  waypoint  groups  for  which  the  null  hypothesis  was  rejected,  i.e.,  for  which 
one  or  the  other  display  arrangement  exhibited  significantly  greater  average 
excess  control  capacity  at  the  0.05  level.  The  totals  show  that  the  null  hypo¬ 
thesis  was  rejected  for  27  of  58  pairs  at  the  0.05  level.  Of  these  27  pairs, 
the  "MFD"  exhibited  greater  average  excess  control  capacity  for  1 6,  and  the 
ltHSI”  greater  for  11  pairs.  In  the  individual  case  of  Pilot  5  tracking  Flight 
Plan  2  involving  only  a  curved  approach,  the  partition  is  in  favor  of  the  "HSI," 
a  result  which  was  consistent  with  that  pilot's  own  appraisal  of  that  flight 
plan.  However,  the  partition  for  Pilot  3  with  Flight  Plan  3,  involving  a  missed 
approach  and  holding  pattern  is  in  favor  of  the  MFD. 


Excess  Monitoring  Capacity 

The  measurement  of  excess  monitoring  capacity  was  inversely  proportional  tc 
the  average  caution  advisory  response  time.  The  null  hypothesis  of  equality 
between  mean  response  times  within  comparable  pairs  of  runs  with  either  display- 
arrangement  was  tested  for  significant  differences  after  a  correction  for  the 
skewness  of  the  response  time  distribution  was  made.  The  results  of  these 
tests  are  listed  in  Table  5  by  pilot.  The  column  heading  "neither"  identifies 

TABLE  5 

NUMBER  OF  COMPARABLE  PAIRS  OF  RUNS  FOR  WHICH  ONE  OR  THE  OTHER 
DISPLAY  ARRANGEMENT  EXHIBITED  SIGNIFICANTLY  GREATER  AVERAGE 
EXCESS  MONITORIES  CAPACITY  AT  THE  0.05  LEVEL9, 


PILOT 

HSI/EADI 

NEITHER 

SADI /MFD 

i 

1 

k 

1 

3 

0 

7 

2 

4 

0 

9 

2 

5 

1 

4 

1 

6 

_1_ 

J+ 

Totals 

5 

25 

10 

^he  null  hypothesis  is  "neither."  The  probability  of 
rejecting  the  null  hypothesis  whpn  it.  1?  true  is  0.05 . 
Behrens',  Scheffe’s,  and  Tukey's  tests  produced  consis¬ 
tent  results. 
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the  number  of  comparable  pairs  of  runs  for  which  the  null  hypothesis  was 
accepted.  The  probability  of  rejecting  the  null  hypothesis  when  it  is  true  is 
0  05  Ag*in  Behrens’,  Scheffe's,  and  Tukey's  tests  produced  consistent  results, 
because  there  were  usually  at  least  eleven  samples  in  the  ensemble  for  each  run. 
The  column  headings  "HSI"  or  "MFD"  identify  the  numbers  of  comparable  pairs  of 
runs  for  which  the  null  hypothesis  was  rejected,  i.e.,  for  which  one  or  the 
other  display  arrangement  exhibited  significantly  greater  average  excess  moni¬ 
toring  capacity  at  the  0.05  level.  The  totals  show  that  the  null  hypothesis 
was  rejected  for  13  of  38  pairs  at  the  0.05  level.  Of  these  13 
"Mfd"  exhibited  greater  average  excess  control  capacity  for  10,  and  the  HSI 
greater  for  3  pairs. 


Pilot  Opinion  Ratings 

Tables  6a  and  6b  present  summaries  of  the  subjective  opinion  ratings  o± 
the  HSI  and  MFD  by  each  of  Pilots  1,  3,  4,  and  6  during  the  second  phase  of  the 
exDarim°r’t  ’’n  March  emphasizing  geographic  orientation  as  well  as  tracking.  A 
comparison  of  Tables  6a  and  6b  shows  a  slightly  less  favorable  central  tendency 
ir  the  ratings  of  the  task  controllability  and  precision  when  using  the  Hoi, 
wh®re»s  ratings  are  more  uniformly  distributed  over  four  descriptive  phrases 
wh°n  using’ the  MFD.  Ratings  of  task  controllability  and  precision  with  the 
flight  director  in  use  are  uniformly  distributed  over  four  descriptive  phrases 
when  using  either  the  HSI  or  MFD.  Comparison  of  the  ratings  for  utility  of 
status  information  between  the  HSI  and  MFD  shows  more  favorable  ratings  for  the 
MFD  and  a  markedly  unfavorably  skewed  distribution  of  ratings  for  the  HSI  which 
exhibits  a  mode  beside  the  descriptive  phrases:  (S4)  "inadequate  number  of 
states...."  Comparison  of  the  ratings  for  clutter  shews  few  difference:  in  the 
tendency* of  both” groups  of  ratings  to  centralize  beside  the  descriptive  phrase: 
(K3)  "some  clutter."  Only  one  rating  of  the  MFD  was  more  unfavorable  than  K3. 
Comparison  of  the  ratings  for  display  attentional  workload  shows  a  more  favor¬ 
able  central  tendency  beside  the  descriptive  phrase:  (D3)  "mildly  demanding" 
for  the  MFD  whereas  the  distribution  of  ratings  for  the  HSI  is  unfavorably 
skewed* with.’ a  mode  beside  the  descriptive  phrase:  (H)  "quite  demanding." 

During  the  tracking  of  reference  flight  paths  in  the  first  phase  of  the 
experiment  in  February,  slightly  different  central  tendencies  in  some  of  the 
ratings  by  Pilots  3,  4,  and  5  were  observed.  For  example,  the  ratings  for 
utility  of  status  information  on  the  HSI  were  bimodally  distributed  between 
(S2)  "many  of  desired  states  presented"  and  (S4)  "inadequate  number  of  states." 
The  rationale  was  apparent  from  the  accompanying  commentary,  viz.,  that  the  Hoi 
is  quite  adequate  for  tracking  rectilinear  courses  whether  inclined  or  not, 
whereas  the  HSI  is  deficient  for  tracking  curved  courses  in  the  presence  of 
wind.  Ratings  of  the  utility  of  status  information  on  the  MFD  were  skewed 
favorably  with  a  mode  beside  S2.  A  few  more  "quite  cluttered"  (K4)  ratings 
of  the  MFD  were  received,  although  the  central  tendency  on  both  HSI  and  MFD 
remained  beside  K3.  Ratings  of  attentional  workload  while  tracking  with 
either  HSI  or  MFD  were  centered  on  (S4)  "quite  demanding,"  although  relatively 
mere  (D5)  "completely  demanding"  ratings  of  the  HSI  were  given. 
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Eye -Point -cf -Regard  (EPR) 

This  experiment  has  resulted  in  the  acquisition  of  our  largest  archive  of 
high  quality  data  to  date  with  the  STI  Eye-Point-of-Regard  System  Model  EPR- 2. 
However,  it  will  he  possible  to  review  only  a  small  sample. of  the  EPR  data 
here.  The  EPR  data  ac  “visit ion  was  confined  to  runs  wherein  both  HSI  and  MFD 
(as  well  as  all  other  active  displays  and  controls  in  the  cockpit)  were  avail¬ 
able  to  the  pilot  in  accord  with  the  experimental  plan  m  Table  1.  The.  reduced 
data  to  be  presented  are  from  the  go-around  phase  of  four  runs  by. two  pilots. 
Figures  5-8  show  EPR  dwell  fractions,  look  fractions,  and  transition  linx  frac¬ 
tions  for  the  pilots  using  either  (a)  only  raw  situation  data  or  (b)  the  flight 
director  with  situation  data  during  the  go-around  phase  to  a  holding  fix  follow- 
ing  a  missed  approach. 

The  dwell  fraction  is  merely  the  relative  dwell  time-weighted  look  fraction, 
and  the  look  fraction  is  simply  the  relative  number  of  fixations  on  each  instru¬ 
ment  or  display.  The  look  fraction  represents  the  ensemble  probability  of  fixa¬ 
tion  and  the  dwell  fraction,  the  temporal  probability .  of  fixation.  The  bi¬ 
directional  link  fraction  is  the  relative  number  of  scan  transitions  in  both 
directions  between  each  pair  of  instruments  or  displays.  In  rare  ca^es ,  tran¬ 
sitions  occurred  in  only  one  direction  noted  by  an  arrow.  The  sums  of  each 
type  of  fraction  may  not  equal  exactly  unity,  because  of  round-off  errors  in 
the  listed  values. 

Comparison  of  Figs.  5  and  6  shows  that  Pilot  3  used  the  HSI  more  when  the^ 
flight  director  was  off  and  the  MFD  more  when  the  flight  director  was  or..  Ihio 
is  exemplified  by  all  three  types  of  fractions.  Pilot  3  was  consistent  in  this 
dichotomy.  There  are  probably  two  underlying  reasons: 

a.  When  using  raw  situation  data,  it  may  be  easier  to  close  the 
heading  control  loop  with  the  mere  familiar  HSI,  and 

b.  When  using  the  flight  director  on  the  EADI,  it  may  be  easier 
to  monitor  the  aircraft 1  s  heading  and  geographic  position 
simultaneously  using  the  moving  map~3i splay  on  the  MFD. 

Direct  crosschecks  between  the  HSI  and  the  MFD,  although  rare,  do  appear 
in  one  direction  from  the  HSI  to  the  MFD  in  the  results  in  Figs.  5  and  7  with 
only  raw  data. 

Pilot  1  (Figs.  7  and  8)  used  the  HSI  hardly  at  all  during  the  gc-arcund 
phase  o  a  holding  fix  with  raw  data  and  not  at  all  during  the  go-arouna  with  ^ 
the  flight  director.  The  absence  of  any  transitions  to  the  altimeter  ir.  rig.  o 
may  be  because  barometric  altitude  was  available  on  the  MFD,  although  we  have 
not  yet  analyzed  subsidiary  transitions  within  the  face  of  the  MFD. 

Pilot  1  a  research  pilot,  was  much  more  familiar  with  the  EADI ^ and  MFD 
than  Pilot  3,  a  commercial  airline  pilot.  The  familiarity  and  confidence  of 
Pilot  1  in  using  the  MFD  is  evident  in  comparing  his  eye -point -of -regard  dis¬ 
tribution  with  that  for  Pilot  3*  in.  spite  of  the  extensive  training  time  pro¬ 
vided  for  the  commercial  pilots  who  participated  in  the  experiment. 
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BI-DIRECTIONAL  LINK  FRACTION 
EXCEPT  AS  NOTED  BY  ARROWS 
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Although  we  must  conclude  this  brief  preview  of  results,  we  shall,  in  the 
s umma  ry^whic h  follows,  provide  a _ concise  overview  of  some  of  the  other  trends 
in  the  eye-point-of- regard  distributions. 

SUMMARY  OF  RESULTS  AMD  CQMCLUDIMG  REMARKS 

On  the  basis  of  the  blunder  distribution  alone  from  the  simulation  the 
MFD  seems  to  offer  a  worthwhile  improvement  m  safety,  since  15  of  20  blunder, 
among  160  runs  involved  runs  wherein  the  MFD  was  not  availaole  to  the  pilot, 
“involved  runs  with  the  MFD,  but  not  the  HSI,  and  only  one  involved  a  run 

with  both. 

The  fight  director  provides  for  more  precise  tracking  of  the  assigned 
<.n~Ltude~=nd  the  glide  slope  than  otherwise.  However,  ther°  is  no  consistent 
evidence  of  differences  between  tracking _ errors  with  the  HSI  versus  the  MFD 
among  the  five  practiced  pilots  who  participated  in  the  simulation,  although 
there  are  instances  where  altitude-keeping  was  more  precise  with  the  MFD  when 
using  only  situation  data. 

Th°  null  hypothesis  of  equality  between  average  excess  control  capacity 
within* comparable  pairs  of  flight  plan  waypoint  groups  using  either  the  HSI  or 
the  MID  arrangement  was  tested  for  significant  differences.  The  results  shorf 
that  the  null  hypothesis  was  rejected  for  2T  of  58  pairs  of  comparable  waypo^.t 
groups  at  the  0.05  level,  where  0.05  the  probability  of  rejecting  the  r.ul_ 
hvoothesis  when  it  is  true.  Of  these  2?  rejected  pairs,  the  MFD  exhibited 
greater  average  excess  control  capacity  for  15,  and  the  HSI  greater  for  11  pairs. 


-in  hypothesis  of  equality  between  average  excess  mcni.ormg  capacity 
within* comparable  pairs  of  runs  using  either  the  HSI  or  the  MFD  arrangement  was 
tested  for' significant  differences.  The  results  show  that  the  null  hypothesis 
was  rejected  for  15  of  58  pairs  of  comparable  runs  at  the  0.05  level.  Of  the-e 
15  rejected  pairs,  the  MFD  exhibited  greater  average  excess  monitoring  capacity 
for  10,  and  the  HSI  greater  for  2  ps-iro. 

The  pilots  provided  subjective  ratings  of  (a)  task  controllability  and 
pr^sion,  (b)  utility  cf  status  information,  (c)  symbol-bacKground  clutter _cn 
?hP  disnl^v  and  (d)  display  attentior.al  workload,  each  on  five-point  descrip- 
Sv,  SiS  siiarizedhoSparisons  of  all  ratings  for  the  MFD  and  the  HSI  ir. 
each  category  follow . 

Task  Controllability  and  Precision.  There  is  a  slightly  less  favor- 
able  central  tendency  to  rate  the  HSI  "controllable,  with  inadequate 
precision,"  in  tracking  curved  paths  in  the  presence  of  wind,  whereas 
the  ratings  favor  the  MFD  as  "controllable,  with  fair  precision. 

Ratings  with  the  flight  director  in  use  are  uniformly  distributed 
over  four  descriptive  phrases  from  "easily"  to  ’'marginally"  control¬ 
lable  and  exhibit  no  central  tendency  with  either  HSI  or  MFD. 

Utility  of  Status  Information.  The  MFD  received  more  favorable  ratings  ^ 
than  the  HSI.  Ratings  of  t'he  usefulness  of  information  supplied  covered 
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for  maintaining  geographic  orientation  vnile  tracking  p 

the  presence  of  wind. 

Ratines  of  both  the  HSI  and  MFD  as  having  "some  clutter”  pre- 
HgS;,  Sphere  is  a  slight  tendency  to  rate  the  MFD  less  savorably. 

attPntional  Workload.  Ratings  of  the  MFD  show  a  central  ten¬ 
dency  between  “mildly"  anT^uite  demanding  "  whereas  ratingc  ? of  - 

HSI  exhibit  a  less  favorably  skewed  moae  between  quite  —d  -cm 
nletely  demanding.” 

Sttssrifts 

single  flight  plan,  the  DPR  dal ta  from  wo  rms^lso  suggest^  pre,ere_ce  £ 
Pnlnt  k  for  the  MFD  m  curved  path  tracking.  In  the  c— e 

data  suggest  an  equitable  dis stributio: ^^^^alSSreSl  'for  the 

wmiMimmmmp 

hn-r-"ontal  dis*plavs  are  scanned,  in  turn,  from  uhe  rADI 

SS^SoScu?  Flight  Flan  5,  *»** 

where  the  HSI  receives  relatively  tore  loohs.  Direot  cros.onee.  .  ret.-^. 
ZZ^t  ord  th°  MFD  is  rare,  but  such  scan  cransitions  are  pre  a— 

dfreSLS?  ffnTsI  to  MFD  by  Pilots  1,  3,  and  k  when  using  raw  situation 

data. 

An  of  the  pilots  provided  a  great  number  of  helpful  supporting  comments 
in  £££&£  e^perisnent.  There  was  a 

ments  which  acknowledged  the  excellence  of  _ -ne  HSI  for  tr-c- -  g 

inclined  courses,  yet  recognized  the  superiority 

confidence  in  geographical  orientation  whi  e  track  g  cur.ea  pp  Rt 

and  establishing  holding  patterns  in  the  presence  of  ine  ^  , 

*2  :LS:^“dP™d  wSStTe  SSr^plaJ!.:^  the* pilot s, 

rr^srSspS »igM be ~ «*» -yys*aR- 

trcT  bearinss  DME,  and  deviations  would  seem  to  be  an  -se..u-^  ~  \"*r  ’  ",  , 

for  SC  I  would... want  both  HSI  and  MFD,  if  the  MFD  could  oa  chared 
wSh  tS  weather  radar  display. ..  .The  HSI  proves  nsy  eon.iidenee  —  -»_* 
map  on  the  MFD.” 
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AIR-TO-GROUND  VISUAL  DISPLAY  SYSTEM 


Daniel  L.  Kugel 

Randall  V.  Gressang,  Capt,  USAF 
Air  Force  Flight  Dynamics  Laboratory 
Wright-Patterson  AFB,  Ohio 


ABSTRACT 


The  ability  to  simulate  realistic  air-to-ground  tactical  missions  is 
verv  important  in  the  evaluation  of  prototype  fighter  aircraft,  testing  of 
new  Air  Force  weapon  systems,  and  the  training  of  pilots  for  high  performance 
aircraft.  Most  aerospace  simulators  however  do  not  have  this  capability.  The 
Air  Force  Flight  Dynamics  Laboratory  at  Wright-Pacterson  AFB,  Ohio  has 
recently  developed,  in  a  joint  effort  with  the  Northrop  Corporation,  a  Large 
Amplitude  Multi-mode  Aerospace  Research  Simulator  (LAMARS).  The  use  of  this 
simulator  for  air-to-ground  tactical  mission  simulation  is  discussed.  The 
utilization  of  a  state-of-the-art  helmet  sight  system  and  its  role  in  the 
development  of  a  he  .id  slaved  visual  display  system  is  explained;  the  major 
subsvstsnis  are  described,  with  principle  emphasis  placed  on  the  visual  display 
svscem,  terrain  model,  and  helmet  sight  system.  Engineering  data  gathered  to 
evaluate  this  area  of  interest  (AOI)  display  is  included,  along  with  details 
of  the  drive  logic  and  a  summary  of  pilot  comments  evaluating  the  ability  of 
the  visual  display  system  to  provide  the  necessary  requirements  to  perform 
the  air-to-ground  mission. 


I .  INTRODUCTION 


The  LAMARS  engineering  simulator  is  currently  in  operation  at  the  AF 
Flight  Dynamics  Laboratory.  This  high  performance  simulator  will  serve  as 
one  of  the  major  engineering  tools  in  the  new  ground-based  simulation  faci¬ 
lity,  the  Flight  Control  Development  Laboratory.  This  facility  is  the  focal 
point  for  coordinated  studies  encompassing  pilot /vehicle  studies,  ground 
based  and  in-flight  simulation  as  applied  to  flight  dynamic  research,  proto- 
tvpe  evaluation  and  weapon  system  development.  The  LAMARS  will  be  used  in 
support  of  programs  covering  advanced  development,  system  development  and 
integration,  and  simulation  support. 


The  combination  of  wide  angle  visual  cues  and  large  amplitude  motion 
cues  that  the  simulator  gives  to  the  pilot  provides  an  environment  which  will 
evoke  proper  pilot  response  characteristics  for  a  large  number  of  mission 
conditions  and  pilot  control  tasks.  Knowledge  of  how  a  pilot  will  react  to 
various  conditions  is  required  during  preliminary  design  and  preflight  evalu¬ 
ation  of  military  aircraft  designs.  Such  prior  knowledge  will  identify  possi¬ 
ble  trade-offs  between  aircraft  performance,  stability  and  control,  and  man¬ 
euverability.  The  engineering  flight  simulator  gives  the  Air  Force  pilot  the 


unique  opportunity  to  significantly  influence  the  design  of  a  flight  vehicle 
long  before  the  final  configuration  is  determined. 

Though  not  its  principle  mission,  the  Flight  Control  Division  of  the  AF 
Flight  Dynamics  Laboratory  has  the  engineering  knowledge  and  first  hand  hard¬ 
ware  experience  to  help  make  determinations  of  simulator  criteria  and  require¬ 
ments."  Recently,  the  Simulator  System  Program  Office  of  the  Aeronautical 
Systems  Division,  Wright-Patterson  AFB  began  work  on  testing  present  Air 
Force  simulators  for  the  purpose  of  developing  visual  requirements  for  air- 
to-ground  mission  simulators.  This  study  was  to  determine  both  qualitatively 
and  quantitatively  the  cues  which  could  be  generated  visually  by  the  LAMARS 
simulator  at  the  AF  Flight  Dynamics  Laboratory,  Wright-Patterson  AFB,  Dayton, 
Ohio,  the  Advanced  Simulator  for  Undergraduate  Pilot  Training  (ASUPT)  at 
Williams  AFB,  Chandler,  Arizona,  and  the  Simulator  for  Air-to-Air  Combat 
(SAAC)  at  Luke  AFB,  Glendale,  Arizona.  By  evaluating  these  three  methods  of 
visual  projection  one  can  better  determine  which  methods  will  best  provide 
the  needed  visual  cues  to  perform  the  air-to-ground  tactical  mission. 

It  should  be  noted  that  this  program  was  conducted  to  show  what  present 
Air  Force  simulators  could  provide  in  the  way  of  air-to-ground  visual  cues. 
The  intent  was  not  to  update  these  simulators  with  the  most  current  hardware 
available  but  to  use  what  hardware  was  in  the  simulators  at  the  time,  note 
any  limitations  which  could  be  eliminated  or  improved  by  recent  hardware 
developments  and  substantiate  these  comments  with  engineering  data.  It 
should  also  be  pointed  out  that  the  LAMAR  simulator  was  designed  primarily 
for  air-to-air  combat  mission  evaluations  and  handling  quality  studies.  This 
simulator  is  not  a  training  simulator  but  an  engineering  design  and  evalua¬ 
tion  tool  obtained  primarily  for  handling  qualities  evaluation  and  air-to-air 
combat  studies. 


II.  DESCRIPTION  OF  THE  MAJOR  SIMULATOR  SUBSYSTEMS 
Visual  Display  System 

The  visual  display  system  of  the  LAMARS  consists  of  a  20-foot  diameter 
spherical  projection  screen  fixed  to  the  cockpit.  Within  the  sphere  are  two 
projectors  which  display  the  visual  information  to  the  pilot  (Figure  1).  One 
projector  mounted  above  and  to  the  rear  of  the  cockpit  projects  the  high 
resolution  detailed  terrain  image  while  Che  other  projector,  located  above 
and  slightly  to  the  rear  of  the  target  projector,  referred  to  as  the  sky/earth 
projector,  projects  a  passive  blue  sky,  green  earth  horizon  display  through¬ 
out  the  entire  sphere. 

As  was  mentioned  previously,  the  pilot  is  at  the  center  of  the  spherical 
screen,  which  helps  eliminate  distortion  of  the  projected  images.  As  a 
result,  the  target  projector  and  the  sky-earth  projector  are  off  center  and 
the  projectors  themselves  have  been  specially  designed  to  compensate  for  this 
effect.  The  target  projector  has  a  docus  servo  which  moves  the  cathode-ray 
tube  (GRT)  in  and  out  with  respect  to  the  projection  lens.  By  knowing  the 
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Figure  1 

Relationship  of  Projection  Systems 


Figure  2 
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position  of  the  projector  with  respect  to  the  center  of  the  sphere  and  also 
the  azimuth  and  elevation  of  the  projector,  the  "throw  distance"  or  focal 
length  from  the  projector  to  the  screen  can  be  calculated  and  used  to  drive 
the  focus  servo.  The  target  projector  also  has  a  "keystone"  correction  provi¬ 
sion  in  its  raster  control  circuitry  to  compensate  for  the  varying  angles 
at  which  the  projected  image  impinges  on  the  surface  of  the  screen.  The  sky/ 
earth  projector  has  a  mechanism  inside  the  twin  hemispherical  transparent 
domes  which  can  position  the  point  light  source  of  the  "sky"  lamp  and  "earth" 
lamp,  to  the  proper  x,  y,  and  z  coordinates  within  the  transparency  so  that 
the  projected  horizon  line  is  always  correctly  oriented  on  the  screen  without 
distortion.  This  is  demonstrated  in  Figure  2. 


Terrain  Model/Computer  Image  Generation 

The  image  which  is  presented  to  the  pilot  is  generated  by  a  large  scale 
model  of  a  typical  terrain  area.  This  three  dimensional  full  color  terrain 
model  has  its  own  gantry  supported,  optical  probe  equipped  television  camera 
positioned  by  precision  servo  drive  systems  commanded  by  computer  signals. 
These  computer  signals  are  a  result  of  the  aircraft  orientation  and  provide 
properly  oriented  visual  cues  to  the  pilot.  Also,  computer  generated  images 
(CGI)  can  be  programmed  and  superimposed  over  the  terrain  image  to  provide 
moving  targets.  Forward  Air  Controller  (FAC)  aircraft,  computer  generated 
heads-up  displays,  etc;  however,  their  utility  using  a  television  display 
system  is  questionable  (see  Appendix  I).  The  resultant  video  picture  is 
transmitted  to  the  cathode-ray  tube  of  the  target  projector  and  is  projected 
onto  the  spherical  screen.  The  sky/earth  horizon  display  and  the  hnri*on  in 
the  video  picture  are  synchronized  by  superposition  of  the  terrain  image  and 
sky/earth  image  on  the  dome.  This  provides  a  60°  diagonal  forward  field-of- 
viaw  displaying  high  resolution  terrain  imagery  and  also  a  wide  field  of  view 
horizon  display  to  produce  the  necessary  peripheral  cues  needed  to  help  the 
pilot  in  judging  sink  rate,  roll  position,  roll  rates,  etc. 


Proj^tion  Capability 

The  requirement  for  a  60°  diagonal  area-of-interest  projection  beam  which 
can  be  positioned  anywhere  within  the  f ield-of-view  of  the  pilot  arose  from 
the  desire  to  have  a  diaplay  capability  which  could  provide  the  visual 
requirements  for  such  tasks  as  air-to-ground  weapon  delivery,  terrain  follow¬ 
ing/avoidance,  and  ♦vike  off /landing  work.  The  resulting  display  capability 
represents  a  com  i-e  between  a  full  field  of  view  terrain  image  and  what 
was  believed  to  be  a  realistic  achievement  based  on  a  thorough  review  of 
existing  hardware. 

The  requirement  of  a  large  field  of  view  results  from  the  new  aircraft 
designs,  which  to  properly  simulate,  require  an  unlimited  view  in  both 
azimuth  and  elevation.  The  LAMARS  simulator  was  able  to  achieve  a  266°  field 
of  view  in  azimuth  and  a  108°  field  of  view  in  elevation.  These  values  were 
not  quite  as  large  as  was  desired  (280'  in  azimutn,  150c  m  elevation)  but 
represent  the  largest  field  of  view  possible,  limited  only  by  the  cockpit 
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gimbal  system  and  the  projector  system  mounting  configuration. 


III.  THE  AIR  TO  GROUND  WEAPONS  DELIVERY  TASK 


In  performing  an  air-to-ground  task  in  the  LAMARS  there  are  three  methods 
in  which  to  display  the  high  resolution  terrain  image.  The  first,  and  most 
Afi°  area  of  interest  to  the  x-axis  of  the  aircraft.  In 
doing  this,  the  rectangular  AOI  will  remain  fixed  in  the  tront  window  01 
the  cockpit.  The  picture  within  the  field  of  view  will  change  solely  as  a 
result  of  the  aircraft  orientation.  This  provides  a  forward  fixed  scene  but 
does  not  permit  the  pilot  to  perform  his  typical  roll  maneuver  when  attacking 
a  ground  target  since  the  target  at  that  point  will  be  out  of  the  area  of  in¬ 
ter  est.’Secondly,  it  does  not  utilize  the  large  field-of-view  capability  of 
the  spherical  projection  screen. 


The  second  approach  would  be  to  fix  the  visual  scene  tc  a  specific  tar¬ 
get.  This  would  permit  the  60°  diagonal  terrain  image  to  appear  anywhere  in 
the  dome  based  on  the  relative  positions  of  the  aircraft  and  the  target. 
por  example,  if  the  pilot  was  flying  into  the  target  the  terrain  image  would 
be  directly  in  front  of  the  pilot.  If  the  pilot  flew  by  the  ta«-r  the 
terrain  image  would  move  off  to  the  side  and  on  back  to  the  rear  the  dome 
and  finally  disappear  from  view.  This  is  certainly  more  realistic  than  the 
first  case  but  does  not  permit  the  pilot  to  do  terrain  following  and  target 
search  since  he  knows  that  if  he  keeps  the  terrain  image  directly  in  front 
of  him  he  will  eventually  fly  directly  over  the  target.  To  help  eliminate 
the  problems  associated  with  the  above  methods,  an  area-of-interest,  head- 
slaved  visual  display  system  was  developed. 


Head  Slaved  Visual  System 

To  develop  a  head-slaved  visual  system,  one  must  first  be  able  to  monitor 
the  pilot's  head  position.  This  should  be  done  in  a  manner  which  is  com¬ 
pletely  unobtrusive  to  the  pilot.  Once  the  head  position  is  known  and  cali¬ 
brated  with  respect  to  a  fixed  reference  point  in  the  cockpit  then  drive 
logic  and  its  associated  software  can  be  developed. 

The  hardware  used  in  this  experiment  was  a  state-of-the-art  helmet  sight 
system.  Sensor  surveying  units  were  rigidly  fixed  to  the  cockpit  and  aimed 
in  the  direction  of  the  pilot's  head.  These  units  emit  fan-like  beams  of 
infrared  light  rotating  at  a  constant  velocity.  The  infrared  beams  sweep 
over  a  reference  photo  sensor  and  two  pairs  of  helmet  mounted  photo  sensors, 
one  pair  on  each  side  of  the  helmet.  The  time  intervals  between  the  pulses 
from  the  helmet  photo  sensors  and  the  reference  pulse  are  a  measure  of  the 
pilot's  head  orientation  relative  to  the  body  axis  of  the  aircraft.  The 
outputs  of  the  photo  sensors  are  transmitted  to  the  helmet  sight  computer 
where  the  angular  computations  are  performed  and  converted  into  azimuth  and 
elevation  information.  The  accuracy  of  the  system  is  one-half  degree  CEP. 

As  was  shown  before,  the  target  projector  and  the  pilot's  head  are  at 
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different  locations  within  the  sphere.  As  a  result,  to  point  the  center 
of  the  image  projected  by  the  target  projector  to  intercept  the  point 
directly  in  front  of  the  pilot’s  head,  a  geometrical  transformation  is 
required. 

At  the  same  time,  it  requires  that  the  distance  from  the  target  projector 
lens  to  the  sphere  be  calculated  to  drive  the  focus  servo.  Because  the  tar¬ 
get  projector  probe  is  not  as  responsive  as  the  angular  position  servos  of 
the  visual  probe,  the  angles  of  the  head  are  then  recalculated  using  a  simi¬ 
lar  geometric  transformation  and  the  position  feedback  signals  from  the  tar¬ 
get  projector.  The  calculated  head  position  and  the  aircraft  position  are 
combined  to  form  a  resultant  drive  signal  to  the  visual  probe  servos.  The 
probe  servos  are  lead  compensated  to  provide  better  response.  This  arrange¬ 
ment  permits  the  pilot  to  look  about  within  the  sphere  and  view  a  correctly 
oriented  visual  segment. 

Other  software  has  also  been  added,  to^help  make  the  visual  image  seen  by 
the  pilot  more  realistic.  One  of  these  is  a  dead  zone  filter  which  is  used 
to  help  eliminate  small  amplitude,  high  frequency  head  movements  from  plac¬ 
ing  jitter  into  the  system.  The  dead  zone  requires  that  the  pilot  move  his 
head  some  predetermined  delta  value  before  the  target  projector  repositions 
itself.  This  allows  the  pilot  to  slovly  scan  the  terrain  image  without 
constantly  repositioning  the  projector.  It  also  acts  to  keep  out  high  fre¬ 
quency  noise  which  could  result  from  calculation  errors  in  the  digital 
machine. 


IV.  EVALUATION  TEST  PLAN 

The  LAMARS  air-to-ground  visual  system  was  evaluated  by  six  pilots  from 
the  Tactical  Air  Command,  Aeronautical  Systems  Division  and  the  Navy.  These 
pilots  were  currently  rated  in  various  high  performance  aircraft  and  were 
highly  experienced  in  the  air-to-ground  task. 

Each  pilot  flew  the  device  for  a  total  of  eight  one-hour  sorties.  The 
pilots  performed  the  following  weapon  deliveries: 


1. 

10° 

skip  bomb 

2. 

15° 

low  angle  bomb 

3. 

15° 

low  angle  strafe 

4. 

20° 

low  angle  dive  bomb 

5. 

30° 

high  angle  strafe 

6. 

30° 

dive  bomb 

7. 

45® 

dive  bomb 

At  vreapon  release  the  following  data  was  recorded: 

1.  aircraft  position,  velocity,  angles,  angular  velocities 
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2.  weapon  miss  distance  (bomb)  or  mean  impact  point  (gun) 

A  number  of  tactical  targets  were  used  during  the  sorties. 

1.  range  target 

2.  gravel  pit 

3.  suspension  bridge 

4.  dam  pumphouse 

5.  railroad  overpass 

6.  small  house 

7.  runway  lights 

8.  highway  intersection 

9.  oil  tank 

10.  SAM  site 

11.  parked  aircraft 

12.  bulldozer 

13.  airport  terminal 

The  aircraft  could  deliver  either  a  MK82  bomb  or  deliver  gunfire  with  a 
30mm  gun. 

A  pilot  questionnaire  was  used  to  gather  information  on: 

1.  opinion  of  weapon  delivery  performance 

2.  ease  of  target  acquisition 

3.  comments  on  AOI  display 

4.  comments  on  simulation  realism 
*  5.  other  pertinent  comments 

The  following  numerical  data  was  computed  for  each  pilot/target/delivery 
conditions: 

1.  mean  along  track  miss  distance 

2.  standard  deviation,  along  track  miss  distance 

3.  mean  cross  track  miss  distance 

4.  standard  deviation,  cross  track  miss  distance 

5.  correlation  coefficient,  along  track  and  cross  track  miss  distance 

6.  median  radial  error 

7.  maximum  miss  distance 
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8.  minimum  miss  distance 

9.  the  number  of  hits  long,  short,  left,  and  right  of  target. 

Engineering  tests  were  also  conducted  on  the  LAMARS  visual  system  itself. 
These  tests  included 

1.  System  Static  Resolution/Modulation  Transfer  Function 

2.  Image  Generator  Static  Resolution/Modulation  Transfer  Function 

3.  Display  Static  Resolution/Modulaticn  Transfer  Function 

4.  Image  Generator  Dynamic  Resolution/Modulation  Transfer  Function 

5.  Display  Brightness 

6.  Display  Grayscale 

7.  Display  Contrast 

8.  Display  Shading 

9.  System  Brightness 

10.  System  Grayscale 

11.  System  Contrast 

12.  System  Shading 

13.  System  Geometric  Distortion 

14.  AO I  Field-of-View 

13.  AOI  Dynamic  Envelope  Size 

16.  AOI  Edge  Transition  Quality 

17.  Target  Tmage  Location  Dynamic  Lag 

18.  System  Rate  Accuracy 

19.  System  Design  Data 


V.  PILOT  EVALUATION  RESULTS 

The  pilots  were  debriefed  at  the  -end  of  each  60  minute  sortie  and  were 
asked  to  evaluate  the  task  with  respect  to  the  LAMARS  head-slaved  visual 
system.  The  following  comments  and  their  cause  were  noted  as  being  common 
comments  of  all  pilots. 


Attributes 

1.  Diversified  Scene  Content  -  The  model  board  type  of  image  generation 
is  unique  in  its  ability  to  produce  high  quality  scene  detail.  The  detail 
has  good  texture  and  is  significantly  more  detailed  than  computer  generated 
imagery. 
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2.  Smooth  Projection  Surface  -  The  spherical  dome  does  not  have  discrete 
display  surfaces  as  do  CRT  type  displays.  Because  of  this  the  scene  can 
transition  throughout  the  266°  x  108°  viewing  area  with  smooth  continuity. 

3.  Head  Slaved  AOI  is  Promising  -  The  pilots  all  felt  that  the  head 
slaved  area-of- interest  display  was  the  best  of  the  three  projection  methods 
within  the  LAMARS  but  felt  it  was  presently  limited  due  to  other  hardware 
shortcomings  presently  found  in  the  rigid  model  visual  system. 


Limitations 

1.  AOI  Too  Small  -  The  rigid  model  visual  system  presently  is  limited 

to  a  60°  diagonal  area  of  interest  (4o°  wise  by  36°  high).  This  Is  inadequate 
for  many  tactical  maneuvers.  For  example,  to  attack  a  ground  target  or  to 
turn  from  base  leg  to  final  during  landing  requires  a  minimum  of  90°  field 
of  view.  A  navigational  land  mark  is  used  to  guide  the  aircraft  and  when 
the  target  or  runway  appears  over  the  left  wing  a  turn  or  roll  maneuver  is 
executed.  With  a’  48c  wide  area  of  interest,  the  1-target  and  "navigational  aid 
are  not  within  the  same  visual  segment.  In  the  head  slave  mode,  the  pilot 
can  command  the  probe/target  projector  to  pick  up  the  visual  cues  he  needs 
to  perform  the  task.  The  pilot,  however,  is  now  placed  in  a  situation  where 
he  is  performing  unrealistic  head  movements  which  degrades  his  performance. 

A  minimum  of  120°  diagonal  (96°  wide  by  72°  high)  area  of  interest  is  required 
to  perform  a  realistic  tactical  maneuver  using  a  head  slaved  visual  system. 

2.  47°  Pitch  Restriction  Too  Small  -  The  AFFDL  rigid  model  visual 
system  has  a  47°  down  pitch  hardware  limit  on  the  visual  probe.  When  this 
limit  is  exceeded  the  visual  segment  is  obscured  by  a  pseudo-cloud  cover. 

In  high  angle  tactical  maneuvers,  since  the  head  and  aircraft  now  command 
the  probes  pitch  position,  these  limits  can  be  easily  exceeded.  It  is  quite 
distracting  and  unnatural  to  lose  the  target  near  weapon  release.  An  un¬ 
limited  pitch  axis  on  the  probe  is  an  ?,of f-the-shelf "  modification  which  can 
be  easily  performed  on  the  AFFDL  system. 

3.  Uncertain  Scoring  Accuracy  -  Scoring  accuracy  using  head-slaving 
is  presently  uncertain  because  it  was  not  possible  to  calibrate  it  prior  to 
pilot  evaluation.  Weapon  delivery  data,  at  first  look,  appears  to  indicate 
that  there  is  a  dynamic  lag  using  head  slaving  which  could  cause  impact 
errors  on  the  order  of  several  hundred  feet  given  the  approach  speeds 

and  dive  angles  used  in  this  simulation. 

4.  Horizon  Mismatch  -  The  sky/earth  horizon  display  did  not  always 
match  the  horizon  display  from  the  rigid  model  terrain. board  image.  The 
reason  for  this  is  that  the  two  were  driven  from  separate  loops,  one  was 
programmed  for  flat  earth  (terrain  board  image)  while  the  other  was  for 
spherical  earth  (sky/earth  display)  and  the  sky-plate  used  by  the  probe  to 
prevent  viewing  the  probe/gantry  in  the  minors  can  displace  the  apparent 
horizon  location  at  high  altitude  or  near  the  mirrors  surrounding  the  board. 
Further  programming  improvements  will  alleviate  this  problem. 
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The  pilots  in  general  found  the  combination  of  the  small  field-of-view 
and  the  pitch  restriction  on  the  probe  produced  unrealistic  piloting  condi¬ 
tions.  They  did  feel  that  if  these  two  problems  were  alleviated/eliminated, 
a  viable  air-to-ground  display  could  result. 
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APPENDIX  I 

TELEVISION  DISPLAY  RESOLUTION 

When  a  television  display  is  used,  one  must  keep  in  mind  that  each 
visual  picture  is  obtained  from  a  series  of  raster  lines  painted  by  a  cathode 
ray  gun.  The  more  lines  a  display  is  able  to  paint ‘Across  the  face  of  the 
screen,  the  more  accurate  will  be  the  resulting  picture.  Using  a  typical 
60°  field  of  view  probe  and  a  television  display  whose  resolution  is  700 
effective  lines,  one  can  determine  at  what  distance  a  target  of  known  dimen¬ 
sions  will  be  able  to  be  recognized. 

If  N  -  number  of  effective  resolved  lines  across  the  sensor's  FOV 

M  =  number  of  lines  across  the  target  for  identification/recognition 
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L  »  maximum  dimension  of  target 
S  =  slant  range 

then  using 

2N  -1  L 

probe  FOV  required  =  —  tan  (1-1) 

if  8  lines  are  required  for  recognition  of  a  moving  target  whose  maximum 
dimension  is  30  feet,  the  effective  resolution  of  the  display  is  700  lines, 
and  the  probe  FOV  is  60° ,  then 


rprobe  FOV  .  M  , 
2  Can  {  - 2N  } 


S  -  2500  ft  -  1/2  mile. 


(1-2) 


It  is  obvious  that  in  unlimited  visibility,  a  30  ft  aircraft  can  be  seen  for 
several  miles.  Using  the  same  information  as  above,  a  30  ft  airplane  at  two 
miles  would  be  painted  by  less  than  two  scan  lines  resulting  in  only  a 
single  dot  on  the  screen. 


APPENDIX  II 
SOFTWARE  EQUATIONS 

The  block  diagram  for  the  head  slaved  visual  system  is  shown  in  Figure  3. 
In  this  appendix,  the  important  software  equations  are  described. 

A.  Head  to  Target  Projector  Transformation 

Azimuth  _  - 

ip*  =  180.0  -  \p 

P^2  =  [10. 0]2  +  [ 2 . 0 ] 2  -  2(10.0) (2.0)  cos  i p* 

Therefore 

=  [104.0  -  40.0  cos  (180.0  -  l^)]172  (II-A-1) 
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r 


B 


-1 

cos 


R  2  +  (2.9486)2  -  (10. 0)2 

n  _ 

2(2.9486)R 


which  reduces  to 


B  -  cos 


-1 


-  91.3055 
5.8973F 


and 


TP 


-1 

*  cos 


R  2  -  91.3055 

_n _ 

5.8973R 


47.29° 


(II-A-4) 


Using  Che  body  axis  coordinates,  x,  y,  z,  then  the  location  of  the  head  with 
respect  to  the  target  projector  is 


xH  =  2.0 

y  h 

2  -  -2.1666 

H 

The  components  of  the  vector  from  the  head  to  the  screen  can  be  given  by 
=  R  cos  i)>H  cos  nH  ^II_ 


y 


p 


*  R  sin  cos  Hy 


(II-A-6) 


z  =  R  sin  n„ 
P  H 


(II-A-7) 


The  throw  distance  to  the  screen  is  then 


Rf.  =  [(W  +  (yH+V 


2  +  <w2'1/2 


(II-A-3) 


B.  Calculated  Head  Angles  from  Target  Projector  Feedbacks 

To  determine  the  calculated  head  angles  based  on  the  position  feedback 
signals  from  the  target  projector,  i>Tp  and  nTp  are  replaced  with  ^iPFB 
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and  n  respectively,  in  equations  II-A-1  through  II-A-4. 

tpfb 


The  results  are 


1/2 

2.0  cos  +  0.5[392. 0+8.0  cos  *  ]  (II-B-1) 


'CAL 


KIP. 


FB 


FB 


:os 


-1 


CAL 


-  104.0 


PCAL 


40.0 


(II-B-2) 


=  2.9486  cos  (nTp  +  47.29°) 


'CAL 


FB 


+  0.5(17. 39 [l+cos(2nTP  +  94.58°) ]  +  365. 22}  ^ 


1/2  (II-B-3) 


n  -  132.71  -  COS 
HCAL 


-1 


FB 


108.6944  -  R 


2  - 
CAL 


58.9727 


(II-B-4) 


C.  Axis  Transformations 

Given  a  set  of  earth  fixed  axes,  tangent  to  the  surface  of  the  earth, 
2  axis  down ,  and  the  aircraft  Euler  angles  (body  axis,  ;,  3,  V)  ana  the 
head  angles  with  respect  to  the  body  axis  (r^,  ^H>  «H  -  °> *  the  various^ 

a-is  transformation  matrices  are  desired.  The  transformation  matrix,  C£, 
from  earth  axis  to  body  axis  is  given  in  terms  of  the  aircraft  Euler 
angles,  9,  'l>  by 


cp  = 
E 


i 

- 

! 

cOci^  j 

c0si|j‘  | 

-  s0 

s6s0c^-c<tsil/ 

s<|>s0st|H"C<[)Ci(J 

scj)c9 

c5s0cip+s(J)s^  ( 

c4)s0s^-s({)ci{i 

i 

C())C0 

(II-C-1) 


where  c  denotes  cosine  and* s  denotes  sine.  The  transformation  matrix,  Cfl, 
from  body  axis  to  head  axis ,  is  given  by 
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D.  Dead  Zone  Logic 


This  is  a  nonlinear  filter  realized  by  two  logic  tests  which  allows 
the  target  projector  to  be  moved  only  by  head  position.  Random  small 
amplitude  noise  is  filtered  out  permitting  a  smooth  projector  response. 
The  position  and  rates  of  the  head  can  be  calculated  using  the  following 


equations. 

Elevation  position  limit  *  ABS  (ri^(new)  -n^(old))  (II-D-1) 

Azimuth  position  limit  -  ABSOi^(new)-  ii^(old))  (II-D-2) 

Elevation  rate  limit  ®  ABS(nw(neW)  “  %  (old))  (II-D-3) 

n  CAL 

Azimuth  position  limit  =  ABS(fc(new)  -  ip  (old))  (II-A-4) 

n  CAL 


Using  a  25  sample/second  update  rate,  the  required  position  and  rate 
limits  can  be  calculated.  As  long  as  the  pilot  does  not  exceed  either  the 
position  or  rate  limits,  then  the  target  projector  will  remain  fixed.  As 
soon  as  either  the  rate  limit  or  the  position  limit  is  exceeded,  the  target 
projector  will  move  to  a  new  position  based  on  the  commands  from  the  head 
sensing  unit.  This  allows  for  small  head  motions  to  be  made  and  not  affect 
the  target  projector  and  also  helps  eliminate  noise  on  the  signals  from 
driving  the  target  projector. 


E.  Visual  Probe  Drive  Equations 

The  visual  probe  is  also  required  to  be  slaved  to  the  pilot’s  head 
(line  of  sight).  The  visual  probe  orientation,  is^ determined  by  the  trans¬ 
formation  matrix  from  earth  axis  to  head  axis,  where 


ci  9  <*)_-  and  ’J>_n  are.  calculated  as  in  section  C. 

E  EQ  EQ  Ey 

The  probe  compensation  required  was  based  on  the  target  projector 
transfer  function  of  ~ 

H(s)  =  —x - -K - 2~  • 

s  +2 ( .7) (25)  s+(25) 


The  probe  was  compensated  to  have  a  transfer  function  of 


H(s) 


K(s+55) _ 

s2+2(.7)(25)s+(25)2 


X 


I 

I 


f 


with  the  compensated  probe  having  a  6%  overshoot 
pensation  was  performed  in  the  digital  computer. 


and  a  6  ms  lag.  The  com- 
using  the  equation 


DRIVE  = 


(ANGLEi  -  ANGLE 
AT 


*  —  +  ANGLE  .. 


The  ang1  •  .o  were  nondimensionalized  and  AT  was  about  16  ms 


N 
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IN-FLIGHT  SIMULATION  STUDY  OF  DECOUPLED 
LONGITUDINAL  CONTROLS  FOR  THE  APPROACH 
AND  LANDING  OF  A  STOL  AIRPLANE1 
By  Edward  Seckel  and  Benjamin  Feinreich 
Princeton  University 

SUMMARY 


A  simulation  study  of  a  powered  lift  ST OL.jtxanspoy tjiaving  decoupled 
'  "  ‘  longitudinal  control 'for  the  approach  and  landing  flight  phases  has  been 

conducted  on  the  Princeton  Navion  in-flight  simulator.  In  the  decoupled 
control  concept,  the  natural  interacting  airplane  responses  (combined  pitch 
attitude,  speed,  and  flight  path  angle  changes  for  fore  and  aft  stick  motion, 
for  example)  are  suppressed,  and  the  pilot  operates  a  separate  control 
lever  for  each  variable. 

In  this  study,  fore  and  aft  control  column  motion  produced  changes  in 
flight  path  angle  without  changing  attitude  or  speed;  the  throttle  commanded 
speed  changes  independent  of  attitude  or  flight  path  angle;  and  the  pitch  trim 
wheel  allowed  independent  pitch  attitude  changes. 

Landings  were  made  rat  of  various  typical  STOL  straight  and  seg¬ 
mented  approaches  usin  ILS  and  precision  optical  guidance.  The  flying 
qualities  were  judged  to  be  very  favorable,  although  a.  -hort  period  of  ad- 
(  justment  to  the  unconventional  constant- attitude,  cons  ant- speed  flare  was 

required.  The  precise  control  over  flight  path  resulted  in  small  touchdown 
point  dispersion  along  with  consistently  low  sink  rates. 


SYMBOLS 


lift  coefficient 

lift  coefficient  out  of  ground  effect 


d7 /du  change  of  flight  path  with  speed,  thrust  constant,  deg/kt 

i  This  study  was  performed  for  NASA,  Langley  Research  Center,  under 
Contract  NAS1-13502. 
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M. 
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m7c 


Mr 


Me. 


acceleration  due  to  gravity,  m/sec2,  ft/sec 
altitude,  m,  ft 

vertical  velocity,  m/sec,  ft/sec 


,  1  =M  ..  3 

pitch  acceleration  derivative  due  to  speed,  ^  ^  ,  rad/sec  per 

y 

m/sec,  rad/sec"  per  ft/sec 

pitch  acceleration  derivative  due  to  forward  speed  command  input, 

1  3  M 


I  -u 
y  c 


rad/sec2  per  m/sec,  rad/sec2  per  ft/sec 


pitch  acceleration  derivative  due  to  angle  of  attack, 
rad/sec2  per  rad  .  ..  * 


JLM 

I  co 

y 


pitch  acceleration  derivative  due  to  rate  of  change  of  angle  of 
attack.  .  rad/sec2  per  rad/sec 

:y 

pitch  acceleration  derivative  due  to  flight  path  angle,  (used  when 
the  equations  of  motion  are  written  in  u ,  7,  rather  than  the 
more  common  u,  or.  9;  for  unaugmented  airplane  -  ~M^h 

*  t  rad/sec2  per  rad 


i  iy 
y 


Ditch  acceleration  derivative  due  to  rate  of  change  of  flight  path 


1  ^  \l  3 

Single,  - - “T  ,  rad/sec-  per  rad/sec 

* 


pitch  acceleration  derivative  due  to  flight  path  command  input. 


,  rad/sec2  per  rad 

v  w  c 


.  1  :M 

pitch  acceleration  derivative  due  to  pitch  attitude,  ^  x0  »  . 

v 


rad/sec  per  rad 


pitch  acceleration  derivative  due  to  pitch  command  input, 

1  C  M  .2  J  / 

—  ,  rad/sec  per  rad/sec 

I  co 
V  c 
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pitch  rate  damping,  — - rad/sec2  per  rad/sec 

y 

roll  rate  equivalent  of  linearly  spanwise  distributed  vertical  gust 
velocity  component,  rad/sec,  deg/sec 

pitch  rate,  rad/sec,  deg/sec 

Laplace  transform  variable 

wing  reference  area,  rrT ,  ft 

thrust,  N,  lb 

airspeed  perturbation,  m/sec,  ft/sec 

.  for-e  and  aft  gust  velocity  component,  m/sec%  ft /set  *  '  . 

side  gust  component,  m/sec,  ft/sec 

trim  airspeed,  knots,  m/sec,  ft/sec 

w  vertical  speed  perturbation,  m/sec,  ft/sec 

w  vertical  gust  velocity  component,  m/sec,  ft/sec 

g 

X  longitudinal  acceleration  derivative  due  to  forward  speed, 

u 

1  :X  w 
—  — —  ,  1/sec 
m  -  u 

X  longitudinal  acceleration  derivative  due  to  forward  speed  com- 

*  ,uc 

3  l  cX  ,  , 
mand,  —  “ -  »  1/sec 

m  :u 

c 


V 


a 


X. 


longitudinal  acceleration  derivative  due  to  angle  of  attack, 

*  1  ‘  ^  X  n  2 

—  - —  ,  m/sec  per  rad,  ft/sec  .per  rad 

m  c  a 


longitudinal  acceleration  derivative  due  to  flight  path  angle, 

"77 


1  ,  m/sec2  per  rad,  ft/sec2  per  rad 
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longitudinal  acceleration  derivative  due  to  flight  path  command 


input,  —  ,  m/sec2  per  rad,  ft/sec2  per  rad 

r  m 

C  ,  _1_  =X 

longitudinal  acceleration  derivative  due  to  pitch  attitude  m  Tf. 
m/sec3  per  rad,  ft /sec2  per  rad 

longitudinal  acceleration  derivative  due  to  pitch  attitude  command 

input,  ~  -qf-  '  m/sec3  per  rad,  ft/sec  per  rad 

C  1  =  Z_ 

vertical  acceleration  derivative  due  to  forward  speed,  ^  5u  . 

1  /sec 

vertical  acceleration' derivative  due  to  forward  sp/e'd  command 


1  :Z  , 
input,  ^  •  l/st 

r  m  :u 

c 


(S  Z  /V  ),  1/sec 
a  o 


1  :Z 


m/sec2  per  rad,  ft/sec  per  rad 

,  _L  ~-Z 

vertical  acceleration  derivative  due  to  flight  path  angle.  m  . 

m/sec2  per  rad,  ft/sec2  per  rad 

vertical  acceleration  derivative  due  to  flight  path  angle  command 

input,  —  4r  •  m/sec2  Per  rad>  ft/sec2  per  rad 

°  .  ,  1  =Z 

vertical  acceleration  derivative  due  to  pitch  attitude.  ^  r=Q  . 
m/sec2  per  rad,  ft/sec2  per  rad 

vertical  acceleration  derivative  due  to  pitch  command  input. 

—  -Sr- ,  m/sec2  per  rad,  ft/sec  per  rad 
m 

c 

angle  of  attack,  rad,  deg 


I 
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y  flight  path  angle,  rad,  deg 

y  flight  path  angle  command,  rad,  deg 

c 

A(  )  perturbation  from  trim  condition 

5  fore  and  aft  control  column  deflection,  cm,  in. 

c 

5  flap  deflection,  rad,  deg 

5  spoiler  deflection,  cm,  in. 

sp 

s  horizontal  tail  deflection,  rad,  deg 

t 

5  throttle  deflection,  cm,  in. 

:h 

6  .^pitch  attitude,  rad,  deg 

d0 

0  pitch  rate,  — ,  rad/sec,  deg/sec 

0  pitch  command,  rad,  deg 

c 

INTRODUCTION 


Piloting  Problems  of  STOL  Airplanes 

The  approach  and  landing  is  probably  the  most  demanding  task  per¬ 
formed  by  the  pilot  of  a  transport  airplane.  It  is  even  more  so  with  an 
STOL  aircraft  that  is  landed  out  of  a  steeper  glide  slope  with  a  higher  ac¬ 
curacy  requirement  on  touchdown  dispersion.  The  problem  is  further  com¬ 
plicated  in  the  case  of  powered  lift  machines  by  their  poor  handling  quali¬ 
ties  in  approach  and  landing.  A  stability  augmentation  system  (SAS)  is 
usually  necessary  to  provide  acceptable  flying  qualities.  However,  some 
deficiencies,  mainly  in  the  longitudinal  plane,  exist  even  with  the  stability 
augmentation.  The  most  important  problem  is  a  poor  flight  path  angle  re¬ 
sponse  to  attitude  changes  w'hich  makes  it  difficult  to  use  pitch  attitude  for 
precision  control  of  the  flight  path.  In  the  powered  lift  airplane  a  sizable 
flight  path  angle  response  is  associated  with  tne  throttle,  but  if  jet  engines 
are  the  power  plants,  there  is  a  time  lag  between  the  throttle  movement  and 
the  change  in  engine  thrust.  This  time  lag  creates  problems  in  attempting 
to  use  the  throttle  as  a  precision  flight  path  control.  The  inadequacy  of 
either  the  control  column  or  the  throttle  alone  in  the  control  of  flight  path 
angle  necessitates  the  use  of  both.  Typically  all  three  flight  variables, 
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flight  path  angle,  forward  speed,  and  pitch  attitude  change  in  response  to 
control  column  displacement;  and  flight  path  and  speed  respond  to  throttle 
movement.  Consequently,  the  pilot  has  to  coordinate  action  on  both  levers 
in  order  to  control  the  three  coupled  variables.  All  this  results  in  the  de¬ 
ficient  flying  qualities.  Adverse  ground  effect  and  turbulence  cause  a  sub¬ 
stantial  increase  in  pilot  workload. 

The  above  observations  are  substantiated  by  Reference  1  in  which  a 
fixed  base  simulator  study  that  has  been  conducted  by  NASA  Langley  Re¬ 
search  Center  (LRC)  is  described.  The  purpose  of  this  study  was  to  deter¬ 
mine  the  flight  characteristics  during  the  approach  and  landing  of  a  repre¬ 
sentative  STOL  transport  airplane  having  a  high  wing  and  equipped  with  an 
external-flow  jet  flap  in  combination  with  four  high-bvpass- ratio  fan-jet 
engines  (see  Figure  1).  Conventional  stability  augmentation  systems  (SAS) 
have  been  applied  to  obtain  satisfactory  handling  qualities. 

S'  * 

The  Concept  of  Decoupling 

Decoupled  longitudinal  controls  have  been  suggested  in  Reference  2  in 
an  attempt  to  improve  the  handling  qualities  of  the  airplane  treated  in  Refer¬ 
ence  1.  The  essence  of  this  concept  is  to  make  each  one  of  the  three  flight 
variables  respond  only  to  one  cockpit  control.  In  this  study,  the  column 
was  chosen  to  affect  flight  path  angle  and  not  to  change  speed  or  attitude. 

The  throttle  handle  was  chosen  to  control  speed  without  affecting  flight  path 
or  attitude,  and  a  pitch-trim  thumbwheel  controlled  attitude  without  charg¬ 
ing  flight  path  angle  or  speed.  Provided  that  the  quality  and  authority  of 
each  of  the  flight  variable  responses  to  their  appropriate  cockpit  levers  are 
adequate,  the  pilot  can  use  one  lever  to  control  one  flight  variable. 

In  landing  such  a  decoupled  STOL  airplane,  speed  and  pitch  attitude 
are  stabilized  early  in  the  approach  and  the  pilot  does  not  have  to  concern 
himself  with  their  active  control  any  more.  He  can  concentrate  on  control¬ 
ling  flight  path  angle  by  using  the  column  only.  With  the  excellent  flight 
path  response  to  column  that  was  provided,  this  resulted  in  a  very  signifi¬ 
cant  improvement  of  flying  qualities  which  is  especially  pronounced  in  the 
presence  of  adverse  ground  effect  and  turbulence. 

The  price  that  has  to  be  paid  for  this  improvement  is  an  unconven¬ 
tional  and  complex  flight  control  system  that  in  the  configuration  suggested 
in  Reference  2,  employed  four  feedback  variables  (u,  or,  0,  q)  and  four  ac¬ 
tive  control  elements:  throttle,  symmetric  spoiler,  flap,  and  horizontal 
tail. 
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Ground  Simulator  Study  of  Decoupled  Controls 

a  fixed  base  ground  simulator  study  of  decoupled  controls  for  the  ex- 
11  .  1  flan  (EBF1  STOL  in  the  approach  and  landing  was  the  subject 

'r Vr.  cl”  AnSpiove^nt  in  flyX  pities  of  .he  longitudinally  da- 
nild  lirolane  in  comparison  to  the  conventionally  augmented  one  was  re¬ 
ported  Visual  cues  in  the  simulator  were  obtained  by  a  closed  circm| 
port  .  .  .  Rnwever  the  use  of  this  system  for  image  generation 

ctuJeddUficultiTs  in  sensing  altitude  and  altitude  rate  resulting  in  higher 
sink  rates  at  touchdown  and  highei  touchdown  point  dispersion  along  the 
runway  than  would  be  expected  in  real  flight. 

The  same  control  concept  has  been  tested  on  a  moving  base  simulator 
(Reference  3)  which  increased  the  realism  of  the  simulation  by  providing 
(Referen  closed  circuit  television  generated  image  of  the 

Tme  tvpTthat  had  bee;  used  on  the  fixed  base  simulator,  was  also  used  on 
the  moving  base  simula.or.  Another  deficiency  of  the  moving  base  s.mul,- 
■  tor  resulted  from  the  wash-out  circuits  needid  to  prevent  post., on.  velocity, 
and  acceleration  limiting* 


In-flight  Simulation  Study  of  Decoupled  Controls 

This  paper  summarizes  an  in-flight  simulation  study  of  the  longitu¬ 
dinal  decoupled  controls  for  the  same  airplane  that  was  the  subject  of 
References^.  2.  and  3.  This  study  which  was  conducted  on  Princeton's 
fn  fliaht  simulator,  was  motivated  by  the  positive  results  obtained  in  the 

lAt-nrs  The  more  realistic  environment  of  the  in-flight  simu- 
U.r'ti’hT. “«i:»orld  Visual  cues  and  its  full  unlimited  motion  (for  the 
STOL  approach  and  landing  problem)  produces  an  investigation  tool  that  is 

superior  to  ground  simulators. 

Decoupled  lateral  controls  were  also  introduced  and  tested  in  the 
,  .  ,  a  s.  however  the  advantages  offered  over  conventional  SAS 

^eTelotTrtignificant  as  in’ the  case  of  longitudinal  decoupling-,  therefore, 
the  in-flight  investigation  included  only  the  latter. 


A  more  detailed  account  of  the  in-flight  study  may  be  found  in  Refer¬ 


ence  4. 
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The  in-flight  simulator  is  shown  in  Figure  2.  It  is  a  "fly-by-wire" 
airplane  with  adjustable  stability  and  control  characteristics.  In  this  test 
program  they  were  adjusted  to  match  the  characteristics  of  an  EBF  STOL 
transport  in  an  approach  and  landing  configuration  having  decoupled  longi¬ 
tudinal  controls  in  some  experiments  and  conventional  SAS  in  others. 


Tested  Configui ations 

Five  control  configurations  were  tested  in  this  program.  Three  of 
them  were  variations  of  decoupled  controls: 

•  Steady  State  Decoupled  (SSD) 

•  Completely  Decoupled  (CD) 

•  "Recoupled"  (REC) 

Two  configurations  were  variations  of  conventional  augmentation: 

•Stability  Augmentation  System  (SAS) 

•  Improved  Stability  Augmentation  System  (ISAS) 

Following  are  the  descriptions  of  those  configurations  and  their 
characteristics . 

Steadv  State  Decoupled  (SSD)  Configuration  -  The  decoupled  longitudi¬ 
nal  control  scheme,  described  in  Reference  2,  was  the  main  subject  of  this 
study.  In  this  design,  steady  state,  rather  than  complete  decoupling  was 
implemented.  The  term  "steady  state  decoupling"  means  that  whereas 
only  one  of  the  three  flight  variables  (airspeed,  flight  path  angle,  or  pitch 
attitude)  exhibits  a  steady  state  change  in  response  to  the  appropriate  cock¬ 
pit  control  lever,  the  other  two  variables  may  undergo  transient  variations. 

The  Navion  in-flight  simulator  control  assignments  for  this  configura¬ 
tion  were:  column  to  control  flight  path  angle  with  a  beep  trimmer  on  the 
left  horn  to  trim  7;  throttle  to  control  forward  speed:  pitch  thumbwheel  to 
control  pitch  attitude.  The  controls  were  mechanized  such  that  the  chances 
in  the  flight  variables  were  proportional  to  control  displacements  from 
their  trim  positions.  A  constant  ±2°/sec  7  rate  was  associated  with  the 
beep  trimmer. 
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The  schematic  structure  of  the  decoupling  control  system  that  was 
employed  in  Reference  2  is  shown  in  Figure  3. 


This  system 
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The  terms  in  the  left  hand  side  (lhs)  matrix  are  made  of  airframe  stability 
derivatives,  augmented  by  feedbacks  of  the  F  matrix.  The  right  hand  side 
matrix  is  determined  by  the  G  matrix.  The  airplane  can  be  made  steady 
state  decoupled  by  choosing  G  such  that  the  rhs  of  equations  (1)  equals  the 
lhs  with  s  =  0.  The  lhs  matrix  determines  the  airplane's  response  dynam¬ 
ics;  therefore,  the  F  matrix  may  be  used  to  obtain  desired  dynamics  of  the 
responses  of  the  three  variables  to  their  controls,  as  well  as  to  minimize 
coupling  transients. 


Analog  computer  traces  showing  the  responses  of  the  steady  state  de¬ 
coupled  (SSD)  EBF  STOL  of  Reference  2  are  shown  in  Figure  4.  It  can  be 
seen  that  this  configuration  has  a  minimal  amount  of  transient  coupling. 

The  responses  of  the  three  variables  to  their  controls  are  rapid  and  very 
well  behaved.  This  situation  was  achieved  by  highly  augmenting  all  sta¬ 
bility  derivatives  with  respect  to  the  basic  airframe  values.  High  aug¬ 
mentation  increases  turbulence  sensitivity.  This  will  be  discussed  further 
in  following  sections. 


Completely  Decoupled  (CD)  Configuration  -  Transient,  as  well  as 
steady  state  decoupling,  results  from  the  following  set  of  equations: 
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In  this  configuration,  feedbacks  are  used  to  null  off-diagonal  terms  in  the 
lhs  matrix  and,  again,  a  prefilter,  G,  is  used  to  obtain  a  diagonal  right 
hand  side  matrix. 
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Unaugmented  Navion  derivatives  were  used  in  the  left  hand  matrix 
diagonal  with  the  exception  of  Mg  which  was  augmented  by  a  factor  of  about 
two  to  improve  pitch  damping. 

Analos  computer  responses  to  step  inputs  of  this  completely  decoupled 
configuration  are  shown  in  Figure  5.  As  expected,  they  do  not  exhibit  any 
co-^fina  transients;  also,  all  three  responses  are  noticeably  slower  than 
those  of  the  SSD  configuration  (Figure  4).  The  responses  could  have  been 
maJe  faster  bv  augmenting  the  Navion  derivatives  in  the  diagonal  of  the  left 
hard  side  matrix.  However,  this  was  not  done  as  the  responses  were  judged 
to  he  quite  adequate  and,  therefore,  there  was  no  reason  to  bear  the  penalty 
of  -increased  turbulence  sensitivity  associated  with  augmented  stability  ce- 
rivative  s . 


"Recoupled"  Controls  (RFC]  -  This  configuration  was  identical  to  the 
completely  decoupled  (CD)  one  with  one  difference:  pitch  attitude  response 
to  control' column  was  restored,'  such  that  column  displacement  caused  a 
change  in  pitch  attitude  that  was  of  equal  magnitude  and  in  the  same  direc - 
Hen  as  the  flight  path  angle  change  produced  by  the  control  column  in  the 
completely  decoupled  configuration.  The  reason  for  including  this  con- 
figuration  in  the  study  was  that  attitude  changes  might  provide  the  pilot  with 
1  good  visual  cue  to  predict  variations  in  flight  path  angle.  It  was  felt  that 
t-e  elimination  of  this  cue  in  the  constant  attitude  landings  with  the  de¬ 
co-pled  configurations  might  present  some  piloting  problems  and  that  they 
might  be  alleviated  by  recoupling  pitch  attitude  to  flight  path  angle  as  they 
are  coupled  in  conventional  aircraft. 

RP.F  STOL  with  Conventional  Stability  Augmentation  System  ISAS'-  - 
t:-:3  configuration  was  the  subject  STOL  aircraft  with  its  conventional  SAS 
as  rested  in  a  previous  flight  simulation  program  that  has  been  carried  out 
at  Princeton  (Reference  5).  The  SAS  included  pitch  attitude  ar.d  pitch  rate 
feedbacks  to  the  horizontal  tail  to  provide  a  pitch  attitude  command /hole- 
svstem.  Speed  feedback  to  the  flap  and  symmetric  spoilers  was  incorpor  - 
ated  in  the  system  to  enhance  speed  stability. 

improved  SAS  (ISAS)  Configuration  -  A  modified  SAS  configuration 
was  evaluated  in  order  to  explore  the  possibility  of  improving  handling 
q-alitiesRSf  the  SAS  airplane  with  a  control-system  that  is  not  as  corr.plf.x 
as  the  decoupling  system.  The  modifications  of  the  SAS  were  the  following: 

•  Lift  response  to  angle  of  attack,  Zq,,  augmentation  was  incorpor¬ 
ated. 

•  The  speed  stability  parameter,  dy/du,  was  changed  rrorr  zero  to  a 
small  negative  value. 
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•  The  thrust  response  lag  was  reduced  (implying  throttle  to  spoilers  or 
flaps  interconnect). 

•  The  thrust  control  sensitivity,  Z5th,  was  increased. 

Figure  6  shows  a  comparison  of  the  SAS  and  ISAS  configurations.  The 
maior  advantage  of  the  ISAS  is  an  improved  flight  path  response  to  contro 

V  i nrmt  having  a  faster  rise  time  and  settling  down  at  a  positive 
column  input,  naving  a  „  •  The 

steadv  state  value  rather  than  going  back  to  zero  as  in  the  SAS  case.  The 
Z  augmentation  is  the  main  contributor  to  this  improvement  The  rise 
time  of  the  flight  path  response  to  throttle  was  also  shortened  in  the  ISAS, 
mainly  by  the  reduction  of  the  thrust  lag. 

Simulation  of  Turbulence  -  Three  components  of  turbulence  were 
simulated  in  this  programTTore  and  aft,  ug;  vertical,  wg;  and 
aradient  pa.  Turbulence  signals  that  had  been  prerecorded  on  a  magnetic 
Tape  were  channeled  into  Navion  controls  to  produce  simulated  turbulence- 
induced  accelerations.  The  magnitudes  of  the  accelerations  were  propor¬ 
tional  to  the  appropriate  stability  derivatives  of  each  configuration.  ug- 
mented  rather  than  bare  airframe,  stability  derivatives  were  used  m  cal¬ 
culating  turbulence  sensitivity,  as  it  was  assumed  that  aerodynamic  sensors 
were  used  in  the  feedback  loops. 

r  ateral  Directional  Dynamics  -  Lateral  directional  dynamics  that  are 
typical  for  the  subject  airplane  were  employed  in  this  program  with  all  e 
various  longitudinal  configurations. 

Adverse  Ground  Effect  -  A  lift  loss  of  4CL/Cl-  -0.1  «»  simulated. 
No  moment  or  drag  changes  were  included. 


V 


Test  Procedure  and  Conditions 


The  test  pattern  that  was  used  is  shown  in  Figure  7.  The  approaches 
were  either  straight -in  with  6°  or  4°  glide  slope  angles,  or  segmented  with 
Cn  initial  9°  or  6°  angle  transitioning  into  a  final  4°  segment.  The  normal 
approach  speed  was  70  kts.  The  outer  part  of  the  approach  was  simulated 
IFR  using  a  TALAR  (Tactical  Landing  Approach  Radar)  MLS  (Microwave 
Landing  Sg.fem.  unit.  A,  an  alfi.ud.  of  61  m  (100  ft)  «h«  p.l« 
to  VFR  using  optical  glide  slope  light  bars  for  guidance.  The  task 
quired  touching  down  within  a  61  m  (200  ft)  long  marked  area  on  the  runway 
with  a  rate  of  sink  as  low  as  possible. 


Evaluation  was  mainly  based  on 
Harper  scale  adopted  from  Reference 


pilot  rating  using  the  familiar  Cooper 
6.  Landing  performance  in  terms  of 
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touchdown  distance  and  rate  of  sink  was  also  measured.  Most  of  the  evalua¬ 
tions  were  flown  by  two  Princeton  University  pilots. 

RESULTS  AND  DISCUSSION 


The  Decoupled  Configurations  Compared  to  the  STOL 
Airplane  With  Conventional  SAS 

General  Aspects  of  Landing  the  Decoupled  Airplanes  -  The  steady 
state  decoupled  (SSD)  STOL  airplane  turned  out  to  be  very  well-behaved. 

The  approach-and- landing  was  performed  at  a  constant  pitch  attitude,  and 
speed* was  constant  at  70  kts.  In  calm  air.  speed  and  pitch  attitude  stayed 
at  their  s. lected  trimmed  values  without  any  pilot  intervention  so  that  he 
had  to  control  only  flight  path.  Flight  path  response  \yas  judged  to  be  very 
favorable  in  terms  of  quickness  and  authority. 

A  short  period  of  adjustment  was  required  for  one  of  the  pilots  to  ac¬ 
cept  the  visual  aspects  of  the  constant  attitude  flare.  The  lack  of  aircraft 
rotation  made  this  pilot  underestimate  the  amount  of  flight  path  angle  change 
associated  with  column  deflection.  This  caused  him  to  overflare.  An  over¬ 
flare  with  the  decoupled  airplane  resulted  in  a  substantial  touchdown  over¬ 
shoot  since  it  maintained  a  parallel  to  the  runway  or  slightly  ascending 
flight  Dath.  and  a  constant  speed  unlike  a  conventional  airplane  that  would 
lose  speed  and  eventually  settle  down  under  similar  circumstances.  The 
decoupled  airplane,  following  an  overflare,  had  to  be  brought  down  by  re¬ 
leasing  pressure  from  the  column.  However,  after  a  short  learning  period, 
this  pilot  was  able  to  acquire  the  proper  technique  that  resulted  in  a  con¬ 
sistently  precise  touchdown  at  the  middle  of  the  prescribed  zone,  with  a 
very  low  sjnk-rate  and  with  a  very  small  work  load.  The  other  Princeton 
oilot,  having  more  experience  with  unconventional  configurations,  did  not 
need  even  the  short  adjustment  period.  It  turned  out  that  the  visual  and 
normal  acceleration  cues  that  are  available  in  a  constant  attitude  flare,  are 
quite  adequate,  enabling  the  pilot  to  land  the  airplane  with  remarkable  pre¬ 
cision. 

Both  pilots  gave  this  configuration  a  Cooper  rating  of  2.0  for  a  4°  ap¬ 
proach  with  no  turbulence.  This  applies  to  the  instrument  part  of  the  ap¬ 
proach  to  the  visual  part,  and  to  the  flare-and-landing. 

The  completely  decoupled  (CD)  configuration,  in  calm  air,  received 
ratings  practically  identical  to  those  for  the  steady  state  decoupled  (SSD) 
STOL,  as  might  be  predicted  considering  the  similarity  of  the  time  re- 
sponsor  of  the  t'tro  configure tionx 
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Some  quotations  from  pilot  commentary  about  the  decoupled  configura¬ 
tions  under  no-turbulence  conditions  follow: 

Pilot  A:  ’’This  configuration,  in  the  ideal  no-wind,  no-turbulence 
conditions  tested,  is  very  good.  The  airplane  is  controlled  very 
easily  on  the  glide  slope  where  speed  is  maintained  automatically 
and  the  pilot  has  to  control  7  only.” 

Pilot  B:  "Airplane  can  be  landed  smoothly  and  accurately;  still, 
it  is  different  from  a  conventional  airplane  in  the  fact  that  a  nor¬ 
mal  airplane  in  the  case  of  an  overflare  will  eventually  lose  some 
speed  and  touchdown.  In  this  airplane  an  overflare  results  in  no 
touchdown  unless  pressure  is  released  from  yoke.” 

A  flight  path  to  column  control  sensitivity  of  Ly ! Li c  =  1.0  deg/cm 
(2.7  deg/inch)  was  found  to  be  a  good  value  for  the  approach,  an.  for  a 
flarfe  out  of  a  4°  approach.  A  35%  higher  value  was  preferred  by  the  pilots 
for  flying  in  simulated  turbulence  or  landing  out  of  a  6°  approach. 

The  capability  to  trim  out  forces  on  the  glide  slope  was  considered  an 
important  feature  of  the  decoupled  system.  A  flight  path  trim  rate  of 
'y  _  _  2°/ sec  was  found  to  be  convenient.  The  trimmer  was  used  quite 

extensively  to  make  corrections  on  the  approach. 

6°  Approach  -  No  degradation  in  pilot  rating  was  associated  with  the 
steeper,  o°,  final  segment  of  the  approach.  The  pilots  felt  that  the  re¬ 
sponsive  control  they  had  over  flight  path  minimized  the  increase  in  work 
load  that  might  have  been  associated  with  a  higher  rate  of  sink. 

Ground  Effect  -  Ground  effect  variations  of  -0.1  <  LC  l/cL_<  ~  °-20 
were  tested  and  found  to  have  no  significant  impact  on  landing  the  decoupled 
configurations.  The  pilots  felt  that  with  the  tight  flight  path  control  that 
could  be  applied,  they  had  no  problem  in  counteracting  the  ground  effect 
"suckdown."  In  contrast  to  these  results,  in  the  case  of  the  STOL  with 
conventional  SAS,  adverse  ground  effect  contributed  to  piloting  difficulties 
and  accentuated  other  airplane  deficiencies,  as  reported  in  Reference  5. 

Figure  8  shows  a  landing  time  history  for  the  steady  state  decoupled 
configuration  for  a  6°  approach  with  no  turbulence.  It  is  very  clear  that  the 
flare  was  performed  by  the  column  only.  The  pilot  did  not  touch  the  throttle 
(or  rather  speed  handle)  throughout  the  flare.  He  made  a  small  B  adjust¬ 
ment  prior  to  flare  initiation.  Speed  and  attitude  are  essentially  constant 
and  Li  ,  Ly ,  and  h  change  smoothly  from  flare  initiation  to  touchdown  at 

h  =  0.  C 
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Segmented  Approaches  -  6°/4°  and  9°/4°  approaches  have  been  per¬ 
formed  and  have  not  been  found  to  present  any  piloting  difficulties  with  the 
decoupled  configurations.  The  pilots  felt  that  the  task  was  quite  easy  to 
perform.  The  flight  path  trimmer  was  the  main  controller  used  by  the 
pilots  in  turning  the  two  segment  corner.  In  calm  air  almost  no  contro 
column  motion  was  used  and  the  corner  was  turned  on  trimmer  omv. 

-\borted  Approaches  -  Aborted  approaches  were  tested  with  the  de- 
coupled  configurations  orTseveral  runs.  The  go-around  technique  .or  these 
airplanes  was  simply  to  pull  on  the  column  in  order  to  commas  an  up 
-Tight  path  angle.  This  indeed  was  very  easy  to  do  and  presented  no  hand¬ 
ling'  problem,  and  was  rated  at  2.0;  however,  the  simulator  climb  per.orm- 
ance  was  very  limited  in  this  maneuver.  The  Navion  in  tms  respect  was 
orobablv  not  an  accurate  simulation  of  the  STOL;  however,  the  climb  per¬ 
formance  of  the  STOL  is  also  very  limited  in  the  landing  configuration,  and 
therefore  the  simulation  was  not  unrealistic.  A  rate  of  climb  of  1.  o2  m/sec 
,300  ft/min)  could  be  established,  and  the  pilots  considered  it  acceptable. 

STOL  with  Conventional  SAS  -  The  STOL  with  conventional  SAS  was 
clear lv  a  more  difficult  machine  to  control.  It  was  described  by  the  pilots 
-is  sloopv  and  sluggish  and  they  had  to  work  harder  to  obtain  touchcown  per- 
-o rm.ance  similar  to  that  of  the  decoupled  configurations.  Landing  it  re- 
cmi red  the  coordinated  use  of  both  column  and  throttle  because  neither  alone 
provided  adequate  flight  path  control.  The  response  of  Ly  to  column  con¬ 
tained  a  transient  due  to  the  presence  of  Zw.  but  no  steady  state  as  d/I  v 
was  zero  The  transient  was  not  oig  enough  to  permit  using  the  column  *=> 
the  sole  controller  in  the  flare.  The  throttle,  on  the  other  hand,  did  pro¬ 
duce  steadv  state  changes  in  7.  but  because  of  tne  lag  that  was  associated 
with  it.  the  pilots  could  not  use  it  for  the  rapid  fine  lift  modulations  that 
’.vo re  required  in  the  flare. 

A  landing  time  history  for  the  SAS  configuration  at  a  4°  approach' with 

-o  turbulence  but  with  the  nominal  ground  effect  dCLmax  =  0,1  C  L~  1  ls 
shown  in  Figure  9.  The  extensive  use  of  the  throttle  is  in  obvious  contrast 
to  the  situation  in  the  decoupled  airplane.  A  throttle  advance  about  8  sec¬ 
onds  prior  to  touchdown  is  employed  in  order  to  obtain  the  desirec  steady 

state  change  in  7- 

Figure  10  provides  a  comparison  of  pilot  ratings  assigned  to  the  com¬ 
pletely  decoupled  (CD)  configuration  and  to  the  SAS  airplane.  (CD  and  SpD 
have  the  same  ratings  in  the  absence  of  turbulence.  )  With  no  turbulence 
the  SAS  is  seen  to  be  rated  3.0  for  the  approach,  which  is  one  unit  worse 
than  the  CD  configuration.  For  the  landing,  the  SAS  is  rated  4.0.  or  two 
units  worse  than  the  decoupled  airplane. 
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The  improve*4  S  (ISAS)  configuration  is  seen  in  Figure  10  to  he 
rated  signific?  aer  than  the  airplane  witfi  the  conventional  SAS.  With 

no  turbulence  it  is  rated  only  0.  5  units  below  the  CD  configuration  for  the 
approach  and  0.  75  units  for  the  landing.  The  ISAS  was  found  to  be  better 
than  the  SAS  because  of  its  improved  flight  path  response  to  column  and  to 
throttle.  However,  the  pilots  still  elected  to  use  both  hands  in  coordina¬ 
tion  while  flying  this  airplane.  They  could  rely  more  c  i  the  control  col¬ 
umn  and  less  on  the  throttle  in  flaring  the  airplane,  but  they  still  did  not 
have  sufficient  confidence  to  use  a  single  control.  This  explains  the  slight¬ 
ly  worse  rating  of  the  ISAS  with  respect  to  the  decoupled  configuration. 

The  Effects  of  Turbulence  -  Turbulence  was  simulated  in  this  pro¬ 
gram  as  described  in  the  Tested  Configurations  section.  In  the  steady 
state  decoupled  configuration  (SSD),  as  was  indicated  before,  highly  aug¬ 
mented  stability  derivatives  were  employed  in  order  to  achieve  fast  re¬ 
sponses  to  pilot  controls ,  and  minimal  coupling  transients.  Since  it  was 
assumed  that  aerodynamic,  rather  chan  inertial,  sensors  were  used  for 
augmentation,  simulated  turbulence  disturbances  were  proportional  to  the 
augmented  stability  derivatives,  rather  than  to  those  of  the  bare  airframe. 
This  resulted  in  an  excessive  sensitivity  to  turbulence  of  the  SSD  configura¬ 
tion  that  was  beyond  the  control  capability  of  the  simulator.  Therefore, 
this  configuration  had  to  be  tested  at  a  lower  level  of  simulated  turbulence 
(about  one-fourth)  than  the  other  configurations.  Using  the  same  ground 
rules,  the  CD  configuration  was  much  less  sensitive  as  its  off  diagonal 
stability  derivatives  were  nulled  (through  the  use  of  feedbacks),  and  all  of 
its  derivatives  were  lower  than  those  of  the  SSD.  This  made  it  possible  to 
test  the  CD  conflagration  with  the  same  amount  of  turbulence  in  which  the 
conventionally  augmented  \SAS)  STOL  was  tested;  therefore,  the  CD, 
rather  than  SSD  was  used  in  comparing  the  effects  of  turbulence  on  the  de¬ 
coupled  versus  SAS  configurations. 

The  main  problem  incurred  by  turbulence  with  all  tested  configura¬ 
tions,  was  heave  upsets.  Increased  pilot  effort  was  required  to  counter 
those  upsets,  depending  on  their  magnitude  and  on  the  effectiveness  of  the 
available  flight  path  control.  Speed  disturbances  were  also  caused  by  tur¬ 
bulence;  however,  since  they  were  mostly  of  low  magnitude  and  frequency 
higher  than  available  with  speed  control,  the  pilots  chose  to  accept  those 
disturbances  rather  than  try  to  fight  them. 

Figure  11  provides  a  comparison  of  landing  the  decoupled  configura¬ 
tions  with  and  without  turbulence.  The  turbulence  related  heave  motions 
can  be  seen  on  the  Ly  trace,  and  the  increased  pilot  activity  is  very  ob¬ 
vious  in  the  control  column  trace. 
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The  effect  of  turbulence  on  pilot  ratings  of  the  various  configurations 
can  be  seen  in  Figure  10.  The  CD  in  landing  is  seen  to  be  degraded  by  tur 
bulence  from  a  rating  of  2.0  to  3.  0  which  is  still  a  good  position  in  tne 
Cooper  scale.  The  SAS  is  degraded  from  4.  0  to  5.  5  which  is  not  too  good, 
as  a  rating  higher  than  6.  5  puts  an  airplane  under  the  "major  deticiencies 
label  The  ISAS  in  landing  with  turbulence  is  rated  at  4.0.  which  is  not 
quite'as  good  as  the  decoupled  airplane,  but  it  is  significantly  better  than 

the  SAS. 


It  should  be  emphasized  that  a  high  sensitivity  to  turbulence  is  not  in¬ 
herent  in  the  steady  state  decoupling  concept.  The  particular  configuration 
that  was  tested  in  this  study  turned  out  to  be  that  wav  since,  as  discussed 
pre  'ousl  the  best  possible  transient  response  was  sought  in  the  design, 
whereas  turbulence  sensitivity  was  not  taken  i.  to  account.  Turbulence  sen¬ 
sitivity  is  an  important  factor  affecting  aircraft  handling  qualities  and  it 
should  be  taken  into  account  in  the  design  of  an  augmentation  system.  Tur¬ 


bulence  sensitivity  may  be  reduced  by: 

•Avoiding  highly  augmented  stability  derivatives  if  aerodynamic 
sensors  are  to  be  used. 


•  Using  inertial  sensors. 


•  1  i  He  ring. 


Indeed,  the  turbulence  sensitivitv  of  the  SSD  configuration  has  been  reduced 
in  the  moving  base  simulator  study  of  Reference  3. 


Landing  Performance 

Figure  12  compares  the  landing  performance  obtained  with  the  de¬ 
coupled  airplane  to  the  performance  of  the  conventionally  augmented  STOL. 
The  data  used  in  this  figure  included  runs  with  turbulence  and  without  it. 

The  median  touchdown  point  of  both  configurations  is  seen  to  be  very  close 
to  the  center  of  the  touchdown  zone  (0  in  the  figure).  The  spread  is  very 
small  for  the  decoupled  airplane  and  somewhat  bigger  for  the  SAS  The 
median  rate  of  sink  of  the  decoupled  control  scheme  is  a  very  good  deal  less 
than  0.3  m/sec  (1  fps).  and  its  spread  is  very  small.  The  median  rate  ox 
sink  of  the  SAS  is  significantly  higher,  at  0.6  m/sec  (2  fps),  but  still  ac¬ 
ceptable.  The  SAS  rate  of  sink  spread  is  markedly  higher  than  the  results 
with  the  decoupled  configuration. 

Figure  13  shows  a  comparison  of  results  obtained  with  the  decoupled^ 
airplanes  in  the  in-flight  simulation  versus  the  ground  simulator  studies ^ot 
„  c  ^ _ 3  -j  t>  ;  -  v~rv  obvious  that,  the  in  -flight  results  arc  muon 
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better  than  those  of  both  fixed  base  and  moving  base  ground  simulators.  Go¬ 
ing  from  fixed  base  to  moving  base  ground  simulator,  seems  to  improve 
noticeably  the  rate  of  sink  performance,  but  hardly  affects  touchdown  dis¬ 
tance  results. 


The  Recoupled  Configuration 

The  recoupled  configuration  was  derived  from  the  completely  de¬ 
coupled  one  by  restoring  pitch  attitude  response  to  control  column,  equal  to 
the  flight  path  angle  change,  as  described  in  the  Tested  Configurations  sec¬ 
tion.  This  was  done  in  an  attempt  to  evaluate  the  role  of  pitch  attitude  as  a 
cue  that  might  help  the  pilot  in  judging  flight  path  changes.  This  configura¬ 
tion  was  introduced  into  the  program  after  the  pilots  had  flown  the  decoupled 
airplanes  extensively.  The  surprising  result  was  that  the  pilots  found  the 
recoupled  control  scheme  to  be  somewhat  less  desirable  than  the  decoupled 
one.  The  first  reaction  of  both  pilots  to  landing  the  recoupled  airplane  was 
to  rate  it  by  0.  5  unit  worse  than  the  decoupled  case.  After  some  more  ex¬ 
perience  had  been  gained,  one  of  the  pilots  thought  that  the  recoupled  con¬ 
figuration  might  be  equivalent  to  the  decoupled  one,  whereas  the  other  pilot 
retained  his  opinion  that  recoupling  caused  some  degradation  of  flying  quali¬ 
ties. 


Pilot  commentary:  "No  advantage  over  the  dc  -oupled  airplane.  ILS 
tracking:  attitude  coupling  neither  helps  nor  hurts  Visual  tracking:  don't  _ 
get  much  help  out  of  attitude  change.  Get  information  from  [the  glide  slope j 
lights.  There  is  a  problem  for  close-in  down-gamma  correction;  I  don't 
like  the  nose  going  down  in  response  to  a  dnwn-gamma  command.  Flare 
and  touchdown:  very  much  like  a  normal  airplane.  Nose  up  and  down  mo¬ 
tion  interferes  with  good  prediction  of  the  touchdown  point.  The  ability  to 
judge  sink  rate  close-in  is  not  improved  by  the  theta  coupling." 

To  conclude,  the  results  of  the  tests  indicate  that  coupling  attitude  to 
flight  path  does  not  provide  any  advantage  over  the  decoupled  situation. 

This  suggests  that  the  perceptions  obtained  by  the  pilot  from  the  visual  and 
motion  cues  that  are  available  in  a  constant  attitude  landing,  are  clearly 
adequate  for  the  task  and  no  additional  improvement  is  obtained  by  using 
pitch  attitude  changes. 


Deceleration  Prior  to  Touchdown 

It  was  felt  that  it  might  be  advantageous  for  the  powered  lift  STOL 
airplane  to  approach  at  a  speed  that  is  somewhat  higher  than  the  touchdown 
this  might  improve  handling  qualities,  and  performance  for  a 
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possible  aborted  approach.  This,  of  course,  necessitates  a  speed I  re u  - 
io-  prior  to  touchdown.  Runs  in  which  the  pilots  were  requested  to  fly  the 
apo roach  at  75  kts  and  touchdown  at  70  kts  were  incorporated  m  the  flight 
program  in  order  to  evaluate  the  effect  of  the  deceleration  maneuver  on 
piloting  with  the  various  configurations.  Speed  reduction  was-  performed 
with  the  decoupled  configurations  by  simply  pulling  back  on  the  speed  com¬ 
mand  lever  (throttle).  In  the  ISAS  configuration,  a  simulated  autopilot  speed 
command  dial  was  employed,  and  in  the  SAS.  coordinated  control  column 
a-d  throttle  movements  had  to  be  used  in  slowing  the  airplane  down  The 
pilots  were  requested  to  reduce  speed  below  breakthrough  altitude  but  no 
precise  point  was  specified.  It  turned  out  that  an  increment  in  workload 

caused  bv  the  deceleration  in  all  cases.  The  amount  of  this  increment 
deoended  on  how  well  the  approach  was  going  prior  to  the  deceleration,  and 

o"  the  configuration.  With  the  decoupled  configurations  deceleration  was 

easv  and  could  be  done  in  coincidence  with  the  tlare.  The  ISAS  and  S.  - 
commurations  required  an  ever  increasing  effort.  The  increased  difticulU 
’  m  performing  the  deceleration  was  offset  by  the  pilots'  choice  to  slow  the 
harder  configurations  down  at  a  higher  altitude. 

Further  details  on  this  subject  are  given  in  Reference  4. 


CONCLUSIONS 


Three  variations  of  decoupled  longitudinal  control  conligurations  anci 
two  variations  of  conventionally  augmented  longitudinal  control  configura¬ 
tions  for  an  EBF  STOL  airplane  in  the  approach  and  landing  flight  phases 
were  studied  through  in-flight  Simulation.  The  conclusions  of  this  study 

follow: 

i  The  decoupled  longitudinal  controls  that  were  tested  produced  very 
favorable  living  qualities  in  the  approach  and  landing.  This  resulted 
in  small  touchdown  dispersions  along  with  consistently  low  sink  rates. 

,  Adverse  «  round  effect  did  not  cause  any  piloting  problems  with  the  de¬ 
coupled  airplane  as  it  could  be  easily  countered  by  control  actions. 

3  Segmented  approaches,  in  which  an  initial  9  glide  slope  segment  was 
*  followed  by  a  shallower  4°  or  6°  final  segment,  were  easy  to  perform 
with  the  decoupled  airplane.  Turning  the  corner  between  segments 
was  found  to  be  an  easy  task.  No  significant  degradation  was  asso¬ 
ciated  with  using  a  6°  rather  than  a  4°  final  glide  slope. 
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4.  No  significant  differences  in  flying  qualities  were  found  between  the 
steady  state  and  completely  decoupled  control  concepts.  This  con¬ 
clusion  would  seemingly  apply  for  any  steady  state  decoupling  scheme 
in  which  transients  were  kept  sufficiently  small  and  of  short  duration. 

5.  The  extreme  turbulence  sensitivity  of  the  steady  state  decoupled  con¬ 
trol  scheme  that  was  tested  in  this  study,  resulted  from  the  use  of 
high  feedback  ga.ns  along  with  aerodynamic  sensors  as  sources  of 
feedback  signals.  High  sensitivity  to  turbulence  is  not  inherent  in  the 
steady  state  decoupling  concept.  It  may  be  reduced  by  either  using 
inertial  sensors,  or  by  avoidance  of  high  gains  in  feedback  paths. 
Sensitivity  to  turbulence  must  be  considered  in  the  design  of  such  a 
control  system. 

6.  The  completely  decoupled  configuration  which,  under  the  assumptions 
adopted  in  this  study,  had  a  low  sensitivity  to  turbulence  produced  in 
simulated  turbulence  a  very  significant  improvement  in  flying  qualities 
with  respect  to  the  conventional  3AS  that  was  tested. 

7.  The  recoupled  configuration  in  which  pitch  attitude  changed  along  with 
flight  path,  but  which  was  otherwise  decoupled,  did  not  result  in  any 
improvement  of  flying  qualities  with  respect  to  the  decoupled  configu¬ 
rations.  This  suggests  that  the  perceptions  obtained  by  the  pilot  from 
the  visual  and  motion  cues  that  are  available  in  a  constant  attitude 
landing,  are  clearly  adequate  for  the  task  and  no  additional  improve¬ 
ment  is  obtained  by  using  pitch  attitude  changes. 

8.  The  conventionally  augmented  airplane  that  was  tested  had  signifi¬ 
cantly  poorer  handling  qualities  than  the  decoupled  configurations, 
especially  for  landing  in  turbulence  and  with  adverse  ground  effect. 
The  piloting  problems  of  this  airplane  resulted  mainly  from  its  low 
lift  response  to  angle  of  attack,  the  lag  associated  with  the  throttle 
response,  and  the  resulting  necessity  to  use  coordinated  control  col¬ 
umn  and  throttle  action  in  controlling  flight  path. 

9.  Modifying  the  conventional  SAS  by  increasing  lift  response  to  angie  of 
attack,  Za ,  changing  dy/du  from  zero  to  a  small  negative  value,  in¬ 
creasing  thrust  control  sensitivity,  Z*t,  and  reducing- thrust  response 
lag,  yielded  a  significant  improvement  in  pilot  rating,  but  the  overall 
level  of  handling  was  judged  to  be  not  as  good  as  for  the  decoupled 
configurations. 
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Figure  1.  Three- View  Drawing  of  Simulated  Airplane  (All  Linear 

Dimensions  are  in  Meters  (ft)) 
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Fisure  ?.  Landing  Time  History  of  the  SAS  Configuration.  4°  Approach: 
Adverse  Ground  Effect;  Nr  Turbulence. 
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Figure  10.  Pilot  Ratings  for  the  Approach  and  Landing  of  the  CD.  ISAS,  and 

SAS  Configurations 


Figure  11.  Superimposed  Landing  Time  Histories  of  CD  Configuration 
Turbulence,  and  the  SSD  with  Xo  Turbulence 
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.figure  12.  Landing  Performance  Ke suits  Figure  13.  Landing  Performance  of  the  De¬ 

coupled  Airplane  Flight  vs 
Ground  Simulators 


HIGH  ACCELERATION  COCKPIT  DISPLAYS  AND  CONTROLS 


By  James  A.  Townsend  and  Norbert  J.  Kropewnicki 
Air  Force  Flight  Dynamics  Laboratory 


SUMMARY 


Modern  fighter  maneuverability  has  outstripped  the  pilot's  pnysical 
capability  to  perform  in  air-to-air  combat.  The  objective  ot  the  Air 
Force  High  Acceleration  Cockpit  program  is  to  provide  pilots  witn 
increased  physiological  protection  during  sustained  and  repeated  mzn 
G  air-to-air’  combat  maneuvers.  A  major  breakthrough  m  usetul  coracat 
maneuverability  -  ’.n  be  achieved  by  providing  the  pilot  with  an  advanced 
cockoit  design  incorporating  an  articulating  pilot  seat  which  allows  — e 
pilot  to  reposition  himself  for  high  energy,  high  G  maneuvering. 
Controls,  displays,  ejection  seat,  -estraining  system,  seat  sup ina ting, 
reclining  actuation  system,  and  other  man-machine  interfaces  are 
included  in  Che  cockpit  design. 


This  paper  describes 
general  with  emphasis  on 
including  side-stick  and 
displays  to  retain  total 
of  pilot  evaluations  of  c 
to  date  are  presented. 


the  High  Acceleration  Cockpit  (HAG)  Program 
the  fighter  cockpit  design  relationships, 
throttle  characteristics  and  the  design  or 
mission  capabilities  of  the  aircraft.  Result 
oraplete  mockups  and  highlights  of  other  work 
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INTRODUCTION 


A  new  concept  in  the  field  of  fighter  aircraft  crew  station  design 
and  control  display  technology  has  emerged  in  the  form  of  ^he  High 
Acceleration  Cockpit,  commonly  referred  to  as  the  HAC  cockpit. 
concept  has  been  developed  since  1970  and  ts  just  now  in  the  threshold 
of  a  full  scale  evaluation  in  a  TF-15  fighter  aircraft,  ihe  lates. 
complete  step  to  date  in  the  development  of  this  concept  is  the  HAu 
Applications  Study  -  tne  basis  of  this  paper. 


BACKGROUND 


The  beginning  of  the  development  activity  in  HAC  was  founded 
growing  appreciation  of  the  fact  that  new  fighter  designsjvere  oui 
the  human  pilot's  capabilities  in  sustained  high-G  (3  to  t ,  turns 


on  :ne 
growing 


repeated  several  times  in  an  air  combat  engagement.  Not  only  were  the 
pilot's  maximum  G  tolerances  going  to  be  good  over  too  short  a  time  span 
compared  to  the  airplane's  continuous  high  G  maneuvering  capability,  but 
the  fatigue  effect  of  repetitive  exposures  would  dramatically  reduce 
post-G  tracking  accuracy  and  ability  to  fly  several  combat  missions  a 
day  in  a  wartime  scenario. 

The  HAC  concept,  that  of  repositioning  the  pilot  transverse  to  the 
load  vector  is  intended  to  significantly  increase  his  G  tolerance,  reduce 
his  workload  and  fatigue,  and  improve  his  cognitive  capability. 

,'igure  1  illustrates  the  effect  of  HAC  seat  back  angle  on  increasing 
the  useable  ACM  envelope. 

Figure  2  illustrates  the  principle  of  the  HAC  seating  concept,  which 
provides  a  reduction  in  the  height  of  the  hydrostatic  column  between  eye 
and  heart  by  allowing  the  G  forces  to  act  transversely.  Blood  pressure 
gyg  level  can  be  maintained  and  blood  pooling  in  the  lower  extremities 
can  be  reduced  with  a  consequent  lowering  of  heart  rate. 


Figure  1 

Usable  ACM  Envelope  Expansion 

For  Continuous  Sustained  Maneuvering 
on  a  Routine  Basis 
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Centrifuge  tests  conducted  at  Wright-Pat terson  AFB,  (AMRL)  and 
Brooks  AFB,  (AMD/ SAM)  (Ref.  1)  have  demonstrated  that  the  RAC  concept 
provides  significant  improvement  in  pilot  cognitive  functioning 
Physiological  condition  in  the  3  to  10  G  range  when  the  seat  back  angle 
is  rotated  above  45  degrees  and  up  to  65  degrees.  A  2  to  3  G  improvement 
in  normal  G  tolerance  has  been  shown  with  substantial  increase  i 
tracking  performance  and  comfort  at  all  G  levels  during  com  a  ype 

maneuvers . 


Background  studies  in  the  last  two  years  have  also  included  two 
manned  air  combat  simulations  at  McDonnell  Aircraft  Compary  s  dome 
simulator.  Manned  simulation  involved  subjects  trom  Tactical  Air  Command. 

Results  indicated  a  distinct  and  consistent  comoat *  because  of 
equipped  "aircraft"  and  a  reduction  in  overall  high  G  exposure  because  of 
the  early  advantage  attained  over  the  non-HAC  equipped  adversary.  (Ref.  2), 


Indications  from  the  latest  of  these  simulations  based  on 
encounters  co.p.rlns  a  HAC-e,uipped  10.5  G  tighter  ith.  non-HAC  e,urpp-d 
7.5  G  fighter  of  otherwise  similar  design  against  a  common  threat  air 

craft  include: 


a.  20%  more  first  offensive  opportunities  for  HAC  A/C  (Engagements 
converted  to  first  kill). 

b.  20%  reduction  in  threat  a/c  first  conversion  capability. 


c.  50%  increase  in  cumulative  missile  hits  in  the  first  40 
seconds  of  the  encounter. 

d.  200%  increase  in  cumulative  bullet  hits  in  the  first  40 
seconds  of  encounter. 

e.  30%  increase  in  kill  ratio  in  the  first  40  seconds  of 
encounter. 

f.  75%  reduction  in  load  limit  exposure  of  HAC  fighter  versus 
non-HAC  fighter. 

g.  Averag.  G  level  reduced  from  3.5  G  for  non-HAC  a/c  to  3.3  G 
for  HAC  a/ c. 


h.  Improved  tracking 

i.  Improved  vision 

j.  Reduced  pilot  workload-physical  and  cockpit  operations. 
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Reduced  Blood  Pooling 
Lower  Heart  Rate 


1  *  >  2G  Increa*  *or  65°  Position 


Figure  2 

THE  APPLICATIONS  MOCK-UP  STUDY 

Our  mock-up  or  design  aid  study  consisted  of  the  following  major 
elements  (Ref.  3): 

a.  Construction  of  a  full  scale  cockpit  mock-up  of  the  single  seat 
version  of  the  F-15. 

b.  Modification  of  a  conventional  ejection  seat  to  include  the  HAC 
reclining/ lifting  capability  (positioning  system). 

c.  The  effect  of  the  pilot  positioning  system  on  the  pilot’s  ability 
to  accomplish  his  mission  tasks. 

d.  Flight  controller  (s- de  arm)  and  throttle  integration  and 
evaluation  for  the  range  of  seat  positions. 

e.  Evaluation  of  control/display  concepts  designed  to  provide 
accessibility  and  visibility  from  all  seat  positions. 

f.  Test  Plan  preparation  and  static  evaluation  of  candidate  cockpit 
configurations  by  six  operational  fighter  pilots. 

Figure  3  depicts  the  HAC  concept  in  the  full  scale  mock-up 
design  aid.  The  standard  ejection  seat  and  launch  rails  are  in  the 
normal  upright  position.  The  HAC  articulating  seat  liner  can  be  raised 
to  the  reclined  and  lifted  position  at  the  pilot's  option  when  he  needs 
load  factor  protection.  The  cockpit  was  designed  so  that  while  flying 
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hard  maneuvers  he  retains  outside-the-cockpit  vision,  can  see  necessary 
inside-the-cockpit  displays,  and  can  reach  the  required  controls  and 
switches.  He  is  provided  with  side  arm  controller  and  low-throw  rudder 
pedals  both  of  which  are  manually  adjustable  for  the  range  of  5th  to 
the  95th  percentile  pilots.  Partial  automatic  adjustment  with  seat 
articulation  is  also  provided. 

EVALUATION  OF  THE  DESIGN  AID 


The  pilot  evaluators  were  subjected  to  a  series  of  tests  including 
anthropometric  measures,  vision  envelopes,  task  performance  times  and 
eye  and  head  movements. 

Pilot  tasks  during  each  simulated  mission  phase  received  subjective 
comments.  The  crew  also  ranked  individual  cockpit  items,  priority  of  use, 
and  design  location  by  a  "Paired  Comparison"  technique.  Pilot  comments 
were  obtained  with  regard  to:  control  and  display  design,  grouping  and 
location,  seat  adjustment,  back  angle,  body  support,  restraint,  escape 
system,  and  external  and  internal  vision.  Two  over-all  display  and 
control  arrangements  were  presented  and  evaluated:  Configuration  A, 

Figure  4,  and  Configuration  B,  Figure  5.  The  first  represented  off- 
the-shelf  components  suitable  for  use  in  a  near  term  flight  test 
evaluation,  and  the  second  a  concept  depicting  an  advanced  HAC  cockpit 
using  technology  compatible  with  the  1980/35  time  frame.  This  latter 
version,  Conf iguration  B,  maintained  a  control/display  capability 
equivalent  to  the  baseline  F-15  aircraft. 

Considerable  effort  was  expended  and  several  iterations  were  made  in 
finalizing  the  design  and  conducting  the  evaluation  of  the  primary 
flight  and  propulsion  controls,  side  arm  and  throttle  (Figure  6).  A 
previous  study  had  also  investigated  this  area  and  evaluated  other 
controller  locations  (Ref  4). 

The  pilot  subjects  felt  that  the  concept  was  good  and  would  provide 
a  very  real  advantage  in  the  combat  area.  The  HAC  seat  with  minor 
improvements  was  considered  as  satisfactory  for  follow-on  testing  in  an 
in-flight  program.  Pilot  tasks,  visibility  and  reach,  and  controls  and 
displays  were  judged  acceptable  for  an  uctual  aircraft  installation  and 
evaluation.  Actual  production  application  if  decided  upon  would  require 
Configuration  B,  as  the  near  term  Configuration  A  was  considered 
compromised  for  full  mission  compatibility. 

Areas  of  concern  were  mainly  those  which  require  in-flight  evaluation. 
Control  capability  with  the  sidearm  controller  will  require  close 
attention,  since  precise  tracking  is  required  in  the  fighter  mission. 

High  fidelity  simulation  will  be  used  for  this  component  prior  to  flight 
tests.  Further  effort  is  also  required  in  design  and  ground  tests  to 
obtain  a  flightworthy  ejection  seat  design  with  its  articulating  mechanism 
prior  to  flight. 
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Side  Arm  Controller 


Throttle 
Figure  6 
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A  basic  issue  emerging  from  the  evaluation  pointed  out  a  requirement 
to  develop  multi-sensor  displays  for  integration  into  the  cockpit  if  the 
HAC  concept  is  ever  to  become  a  production  system.  The  need  for  these 
displays  is  a  result  of  the  reduced  main  instrument  panel  area  due  to 
the  interference  of  the  pilot* s  legs  in  the  reclined  position.  Several 
programs,  independent  of  HAC,  dealing  with  such  displays  ar^  currently 
in  the  development  phase  at  Wright-Patterson  AF  Base.  An  example  is  the 
Multimode  Matrix  Display  (MMM) ,  which  is  a  short  depth,  light  weight, 
low  power  level  device  using  Light  Emitting  Diodes  (LED)  to  display 
primarily  alpha-numerics. 


FUTURE  PLANS 


The  next  plateau  in  the  overall  High  Acceleration  Cockpit  program  is 
the  design,  fabrication,  and  flight  test  of  the  cockpit  in  a  TF-15  air¬ 
craft.  This  phase  is  about  to  commence  in  the  form  of  an  Advanced 
Development  Program  (ADP).  The  flight  test  will  include  basic  tracking 
maneuvers  at  various  G  levels  and  seat  back  angles,  simulated  combat 
with  an  adversary  aircraft,  and  demonstration  flights  for  interested 
service  and  industry  personnel.  The  ADP  contract,  with  McDonnell  Air¬ 
craft  Company,  will  require  3C  months.  Flight  tests  will  be  run  during 
the  last  half  of  1978. 


REFERENCES 


1.  Burns,  J.  W.:  A  Re-evaluation  of  a  Tilt-Backseat  as  a  Means  of 
Increasing  Acceleration  Tolerance.  USAF  SAM/VNB,  1974,  Brooks  AFB,  Texas. 

2.  Sinnett,  J.  M. ,  Prouhet,  E.  P.,  et  al:  High  Acceleration  Cockpit 
Simulator  Evaluation.  AMRL-TR-75-123 ,  1976,  in  draft. 

3.  Mattes,  R.  E.  ,  et  al:  High  Acceleration  Cockpit  Applications  Study. 
AFFDL-TR-75-139,  1976,  Volumes  I  to  IV. 

4.  Mattes,  R.  E. ,  Asiala,  C.  F. ;  High  Acceleration  Cockpit  Controller 
Locations.  AFFDL-TR-74-48 ,  1974,  Volume  I  -  Program  Summary. 


279 


SESSION  IV 


NONVISUAL  DISPLAYS 


Chairman:  W.  H.  Levison 


X 


DESIGN  OF  AN  ENGLISH  LANGUAGE  DATA 

BASE  QUESTION-ANSWERING  SYSTEM 

By  David  L.  Waltz 

University  of  Illinois 
Coordinated  Science  Laboratory  and 
Department  of  Electrical  Engineering 
Urbana,  Illinois  61801 


SUMMARY 


A  question-answering  system,  PLANES,  has  been  written  to  allow  a  casual 
user  to  obtain  information  from  a  large  data  base  by  typing  natural  -ngiish 
questions  to  the  system.  This  paper  discusses  design  goals,  gives  examples 
of  PLANES'  operation,  and  discusses  helpful  factors  ana  aesign  difficulties 
in  the  PLANES  world. 


INTRODUCTION 


A  prime  obstacle  for  non-tec'nnical  people  who  wish  to  use  computers 
has  been  the  need  to  either  learn  a  special  language  for  communicating  with 
the  machine  or  communicate  via  an  intermediary.  We  feel  that  the  time  is 
ripe  tor  computers  to  be  equipped  for  natural  language  systems  vr.icn  can 
be  used  by  persons  who  are  not  trained  in  any  special  computer  .anguage. 

In  order  for  such  systems  to  be  of  value  to  a  casual  user,  the  systems  must 
tolerate  simple  errors,  must  embody  a  degree  of  "common  sense,"  must  nave 
a  relatively  large  and  complete  vocabulary  for  the  subject  matter  to  be 
treated,  must  accept  a  wide  range  of  grammatical  constructions,  me  of  course 
must  be  capable  of  providing  the  information  and  computations  requested 

by  the  user. 

We  are  developing  such  a  system  called  PLANES  (for  Programmed  LANguage- 
Based  Enquiry  System)  at  the  University  of  Illinois  Coordinated  Science 
Laboratory  (see  reference  1).  PLANES  includes  an  English  language  rront 
end  with  the  ability  to  understand  and  explicitly  answer  a  user  s  requests 
and  to  carry  on  clarifying  dialogues  with  him,  as  well  as  the  ability  to 
answer  vague  or  poorly  defined  questions.  We  are  also  building  a  library 
of  associated  programs  which  includes  functions  for  recognising  patterns 
within  the  data  base  and  for  alerting  a  user  when  certain  patterns  of  data 
occur  which  are  of  interest  to  him.  This  work  is  being  carried  out  using 
a  subset  of  the  U.E.  Navy  3-M  data  base  of  aircraft  maintenance  and  flight 
data,  although  the  ideas  can  be  directly  applied  to  other  record-based  data 
bases,  both  military  and  non-military. 
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THE  GOAL  OF  PLANES 


Our  main  goal  is  to  allow  a  non -prog rammer  to  obtain  information  from 
a  large  data  base  with  no  prior  training  or  experience.  A  system  to  realize 
this  goal  (1)  must  be  able  to  understand  to  a  substantial  degree  a  user's 
natural  language  and  (2)  must  be  able  to  help  guide  and  educate  the  user 
to  formulate  requests  in  a  form  that  the  system  can  understand  (see  reference 
2). 


Subgoals  of  PLANES 

Ne  have  formulated  a  number  of  subgoals  which  we  feel  are  important 
for  realizing  our  main  goal: 

(1)  The  system  must  accept  a  user's  "natural"  English  input,  possibly 
including  complex  syntactic  constructions,  abbreviations,  pronoun  reference, 
and  ellipsis  (i.e.  omission  of  one  or  more  words  that  can  be  obviously 
understood  in  context) . 

(2)  The  system  must  provide  explicit  answers  to  questions,  and  not 
merely  retrieve  a  file  which  somewhere  contains  the  answer.  The  system 
should  phrase  its  answer  in  a  clear  manner,  including  units  or  dimensions 
of  numerical  answers  and  a  description  of  the  answer  values.  Whenever 
possible,  graphical  output  is  desirable. 


(3)  The  system  must  be  tolerant  of  minor  errors,  e.g.  spelling  and 
grammatical  errors;  it  should  suggest  corrections  for  user  approval  whenever 
possible,  and  should  in  general  be  able  to  continue  processing  of  the 
corrected  request  without  requiring  a  complete  retyping  of  tht  request. 

(4)  The  system  should  use  clarifying  dialogues  for  several  purposes: 

(a)  to  feed  back  its  understanding  of  the  user's  request,  so  that 
the  user  can  feel  confident  that  the  system  has  understood  his 
request. 

(b)  to  ask  the  user  pointed  questions  about  portions  of  a  request 
which  it  does  not  understand,  in  the  hope  of  evoking  a  paraphrase 
it  can  understand. 

(c)  to  add  new  words,  phrases  and  sentences  to  the  system's  know¬ 
ledge  base. 

(d)  to  provide  appropriate  HELP  file  information  in  the  event  of 
user  errors  or  direct  requests  for  help. 

(e)  to  provide  information  about  the  system's  capabilities,  abbre¬ 
viations  it  knows,  general  contents  of  the  data  base,  and  other 
such  information  to  help  orient  a  new  user. 
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(5)  Such  a  system  should  be  convenient  to  use: 

(a)  it  should  be  interactive  and  on-line, 

(b)  it  should  operate  rapidly.  One  minute  seems  to  us  to  be  a 
critical  length  of  time:  if  one  in  general  gets  responses  in 

less  than  a  minute,  one  retains  a  level  of  involvement  and  interest 
in  the  interactive  process;  past  a  minute,  interest  begins  to  wane, 
and  boredom  or  impatience  sets  in. 

(c)  the  system  should  require  a  minimum  of  typing;  abbreviations 
and  ellipsis  should  be  routinely  handled. 

(6)  The  system  should  be  relatively  easy  to  extend,  both  within  its 
own  world,  and  to  new  data  bases  and  domains  of  discourse. 

It  is  unlikely  that  any  design  can  simultaneously  satisfy  all  these 
subgoals  in  an  optimum  manner.  In  fact,  in  PLANES  we  have  chosen  a  solution 
which  is  not  particularly  easy  to  extend,  either  within  our  world  or  to 
another  world.  In  addition,  in  our  current  implementation  we  have  not 
included  mechanisms  for  satisfying  all  the  clarifying  dialogue  subgoals. 

In  particular,  the  system  now  fails  totally  if  it  does  not  recognize  any 
one  of  the  phrases  in  a  sentence.  Unlike  the  problems  of  extending  the 
system,  this  subgoal  can  be  realized  within  our  model  relatively  easily. 

We  have  achieved  considerable  success  on  all  other  subgoals,  though  our 
current  system  (as  of  May  1976)  will  answer  probably  no  more  than  25%  of 
the  questions  phrased  by  a  novice  user. 

EXAMPLES  OF  PIANES '  OPERATION 


All  examples  in  this  section  are  taken  from  actual  transcripts  of 
PLANES'  operation.  Figure  1  is  a  complete  transcript  of  the  operation  of 
the  current  planes  system  on  a  single  question.  Note  that  the  paraphrase 
and  answer  are  expressed  in  a  form  which  is  not  as  intelligible  as  an 
English  form  would  be.  We  intend  to  soon  augment  the  system  to  provide 
English  paraphrase  and  answer  forms. 

Figure  2  shows  an  extended  session,  with  all  but  the  question  and 
answer  suppressed.  Figure  2  illustrates  the  ability  of  the  system  to  handle 
ellipsis  (i.e.  information  omitted  which  is  to  be  understood  in  context). 

Figure  3  shows  PLANES'  ability  to  provide  answers  in  a  graphical 
form.  NORKRS  stands  for  ’'Not  Operationally  Ready  HouRS";  ACTDATE  stands 
for  "ACTion  DATE";  2001  is  the  first  day  of  the  year  1972. 

Figure  4  shows  a  short  dialogue.  PLANES  first  asks  for  time  period 
information  necessary  for  performing  its  data  base  search.  It  then 
detects  a  misspelling  and  suggests  two  plausible  corrections  to  the  user. 
Once  the  user  selects  the  appropriate  correction  by  typing  "2",  the  system 
is  able  to  continue  processing  without  requiring  further  typing. 
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Please  enter  your  question . 

HOW  MANY  FLIGHTS  HOURS  DID  PLANE  0003  LOG  IN  JAN  73 

parsing . 

{cpu  time  was  1.74  seconds,  real  time  was  5.95  seconds. 
I  have  understood  your  request  as: 

(COUNT  TIME 

(NET  NIL) 

(FLY  (PLANE  (PRONOUN  NIL) 

(TYPE  PUNE) 

(BUSER  3.) 

(PLNEG  NIL) 

( PLDAM  NIL) 

(PLMAI  NIL)) 

NIL) 

(TIME  (DATE  (MONTH  (1.  0.  0.)) 

(DAY  NIL)  (YEAR  73.))  NIL)) 


Interpreting . 

[cpu  time  was  0.29  seconds,  real  time  was  3.71  seconds. 

I  have  interpreted  your  request  as  follows: 

(FIND  ALL 

((V  O)) 

((SUM  (V  T0THRS))) 

(AND  (EQU  (V  ACTDATE )  301.)  (EQU  (V  BUSER)  3.)) 
NIL) 

Evaluating . 

l cpu  time  was  6.29  seconds,  real  time  was  16.9  seconds. 

((SUM  T0THRS)  =33.) 


Figure  1.  Complete  transcript  of  PUNES '  answer 
to  a  user  query 


}>  How  many  flights  did  the  A7  with  tail  number  003  make  in 
January,  1973? 

((SUM  TOTFLTS)  =17.) 


)>  How  many  flights  did  plane  003  make  in  Feb.  73? 
((SUM  TOTFLTS)  =  1.) 


))  During  April? 
((SUM  TOTFLTS)  =8.) 


>)  March? 

((SUM  TOTFLTS)  =13.) 


) )  All  of  1973? 

((SUM  TOTFLTS)  =39.) 


Figure  2.  Partial  transcript  of  dialogue  showing 
PLANES'  handling  of  ellipsis 


))  plot  how  much  maintenance  occurred  to  plane  ?  between  Jan.  1  and 
Jan.  12,  ’72. 


Evaluating . 

[cpu  time  was  29.32  seconds,  real  time  was  38.11  seconds. 
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Figure  3.  PLANES’  graphical  presentation 
~~~  ”  of  an  answer 


288 


))  Which  Skyhawks  logged  less  than  10  hours 
PLEASE  ENTER  A  SPECIFIC  TIME  PERIOD: 


))  between  Jan.  1  and  Jule  1  *73. 

I  DON'T  KNOW  THE  MEANING  OF  JULE 
PERHAPS  IT'S  MISSPELLED? 

PLEASE  ENTER  A  NUMBER  CORRESPONDING  TO  ONE  OF  THE  FOLLOWING 

1.  JUNE 

2 .  JULY 

3.  NONE  OF  THE  ABOVE 


»  2. 

OK 


(Answers  question) 


igure  4.  Example  of  PLANES'  dialogue 

and  spelling  correction  abilities 


THE  PLANES  WORLD 


In  this  section  I  describe  the  general  environment  in  which  the  lan¬ 
guage  understanding  programs  operate.  This  environment  includes  the  data 
base,  the  user,  and  the  user's  range  of  queries. 


The  PLANES  Data  Base 

We  have  obtained  a  data  base  from  the  Navy  3-M  Data  Base  for  Aircraft, 
Mechanicsburg,  PA.,  consisting  of  complete  records  of  aircraft  maintenance 
and  flight  information  for  48  A7  and  F4  aircraft,  extending  over  a  period 
of  two  years.  Each  time  a  plane  is  serviced,  a  record  is  made  including 
such  information  as  the  time  and  duration  of  the  maintenance,  who  performed 
it,  what  action  was  taken,  which  parts  were  used,  the  manufacturers  of 
these  parts,  whether  or  not  the  service  was  scheduled  or  unscheduled,  and 
so  on.  Records  on  the  number  of  flights  and  the  number  of  hours  in  the 
air  are  also  kept  for  each  plane.  There  are  roughly  forty  different  record 
formats  which  occur  in  the  data  base,  each  containing  between  ten  and  twenty 
separate  fields,  where  each  field  encodes  information  like  the  date  of 
the  action,  type  of  aircraft,  serial  number  of  the  aircraft,  type  of  mal¬ 
function,  component  serviced,  the  work  station  pertorming  maintenance  and 
so  on . 


The  48  aircraft  in  our  data  base  are  divided  into  three  groups: 


(1)  24  planes  which  crashed  or  sustained  major  damage  in  accidents 
involving  mechanical  failures; 

(2)  12  planes  with  bad  maintenance  records;  and 

(3)  12  planes  with  good  maintenance  records. 

"Good"  and  "bad"  records  were  judged  by  comparing  the  ratio  of  the  number 
of  NOR  (Not  Operationally  Ready)  hours  to  the  number  of  flight  hours.  A 
high  ratio  represents  a  bad  record  while  a  low  ratio  corresponds  to  a  good 
record. 


In  addition,  we  have  summaries  of  maintenance  and  flight  data  for 
all  F4  and  A7  aircraft  for  the  same  two  year  period,  so  that  we  can  have 
some  basis  for  classifying  events  as  "normal"  or  "unusual." 


g 

The  PLANES  data  base  contains  on  the  order  of  10  bits,  and  occupies 
about  one  third  of  a  DEC  RP04  disc  pack  in  compressed  form.  This  data 
base,  while  quite  large,  represents  only  a  fraction  of  the  entire  3-M 
data  base,  which  now  contains  on  the  order  of  10 1 1  bits  (10  ye»i- s# 
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Please  enter  your  question . 

)>  What  types  o£  aircraft  are  there. 

parsing . 


THE  FOLLOWING  PIANES  WILL  BE  RECOGNIZED  BY  THE  SEMANTIC  NET: 

(1)  A3 

(2)  A7 

(3)  F4 

(4)  Fll 

(5)  CHEROKEE 

(6)  PHANTOM 

(7)  SKYHAWK 

The  above  planes  may  be  further  specified  by  giving  the 
"tail  number"  (i.e.  BUSER,  BUNO,  Bureau  Serial  Number,  etc.) 
along  with  the  name  of  the  series. 

[cpu  time  was  2.59  seconds,  real  time  was  5.75  seconds. 


Figure  5.  PLANES'  ability  to  answer  questions 
about  its  contents. 
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data  on  all  U.S.  Navy  aircraft,  plus  summaries). 


Helpful  Factors  in  the  PLANES  World 

A  number  of  factors  contribute  to  making  our  problem  much  easier  to 
solve  than  the  general  problem  of  understanding  unconstrained  natural  lan¬ 
guage  . 

(1)  Lack  of  ambiguity  -  Relatively  few  words  and  virtually  no  sentences 
in  the  PLANES  world  are  ambiguous.  The  only  ambiguous  words  we  have  been 
able  to  find  are  "wing"  (meaning  "a  squadron”  or  "part  of  a  plane")  and 
"flight”  (meaning  "a  flying  event"  or  an  adjeccive,  as  in  "flight  computer" 
or  "flight  director").  This  means  that  if  PLANES  can  find  any  interpre¬ 
tation  at  all  for  a  request,  it  is  in  all  likelihood  to  correct  interpre- 
tation. 

(2)  Small  vocabulary  -  Our  current  system  has  about  1100  words.  We 
estimate  that  2000  words  will  cover  90%  or  more  of  all  requests  made  by 
users  with  at  least  a  little  prior  experience  with  PLANES. 

(3)  Only  two  modes  -  PIANES  is  always  either  answering  a  question 
from  the  data  base  or  attempting  to  help  a  user  express  his  request  in  a 
form  PLANES  can  understand.  In  general  the  system  need  not  deal  with 
declarative  sentences. 

(4)  People  do  not  type  complex  sentences  -  The  increasing  likelihood 
of  making  typing  errors  in  lengthy  requests,  the  increasing  likelihood 
that  long  requests  will  baffle  a  program  in  some  aspect,  and  general 
laziness  all  contribute  to  keeping  input  requests  short  and  simple  in 
construction.  Reference  3  describes  an  experiment  in  which  non- programmers 
thought  that  they  were  communicating  with  an  intelligent  program,  when  in 
fact  they  were  interacting  with  another  person  who  would  respond  appro¬ 
priately  to  any  input.  It  was  found  that  10  simple  sentence  types  covered 
7870  of  all  input  requests,  and  that  another  10  would  handle  all  but  10% 

of  the  requests. 

(5)  Less  than  100%  answer  rate  is  acceptable  -  We  feel  that  a  907, 
answer  rate  without  rephrasing  would  be  adequate  to  keep  a  user's  interest 
and  provide  a  practical  and  useful  system.  It  is  possible  that  even  a 
lower  rate  might  be  acceptable. 

(6)  We  have  a  good  idea  of  what  potential  users  would  like  to  know  - 
The  Navy  has  made  a  study  of  all  the  requests  made  to  3-M  data  base  during 
a  one  month  period,  and  the  frequency  with  which  various  requests  were 
made.  We  thus  had  a  good  idea  about  where  to  concentrate  our  initial 
efforts,  and  the  order  in  which  to  proceed. 

Not-So-Helpful  Factors  in  The  PIANES  World 

(1)  The  System  must  contain  a  great  deal  of  specialized  knowledge  - 
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one  of  our  major  realizations  has  been  that  a  small  number  of  general  rules 
cannot  suffice  to  "translate"  natural  English  requests  into  data  base  queries. 
Consider  the  sentences: 

fSl)  "Which  A7  has  the  worst  maintenance  record? 

(S2)  MFind  any  common  factors  of  plane  numbers  37  and  78. 

Clearly  the  system  must  contain  special  programs  to  compile  a  ' ‘maintenance 
record"  and  special  knowledge  to  judge  its  "goodness";  the  system  must 
know  that  having  the  same  digit  as  the  fourth  element  of  their  serial 
numbers  does  not  constitute  a  "common  factor",  but  that  similar  event 
sequences  are  important. 

(2)  Each  request  may  be  expressed  in  a  great  many  different  ways. 
Clearly,  if  users  are  encouraged  to  sit  at  a  console  with  little  or  no 
prior  training  or  instruction,  and  if  the  system  is  expected  to  understand 
enough  of  a  user's  input  to  keep  his  interest  and  perform  useful  actions 
from  the  beginning,  then  the  system  must  be  able  to  make  some  sense  of  a 
large  number  of  types  of  queries,  and  a  wide  range  of  syntactic  construc¬ 
tions  . 


CONCLUSION 


This  paper  has  presented  only  a  brief  discussion  of  some  design  issues 
and  examples  of  the  performance  of  a  natural  language  front  end  for  a  large 
data  base.  Many  issues  remain,  both  in  the  man/machine  interface  and  in 
the  underlying  understanding  mechanisms,  but  it  is  clear  that  we  can  now 
think  of  the  design  of  such  systems  in  engineering  terms  -  such  systems 
3^0  no  longer  in  the  realm  of  science  fiction. 

Readers  interested  in  more  details  of  implementation  and  operation 
can  consult  reference  3;  a  technical  report  describing  the  system  much  more 
completely  will  be  available  in  August  1976. 
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effects  of  linguistic  redundancy  ON  PILOT’S 

COMPREHENSION  OF  SYNTHESIZED  SPEECH 
Carol  A.  Simpson* 

Ames  Research  Center,  NASA,  Moffett  Field,  California 


SUMMARY 


Previous  research  indicates  that  pilots'  familiarity  with  aircraft 
phraseology  results  in  high  intelligibility  for  synthesized  speech  cockpit 
messages  presented  as  short  sentences  -  95  percent  articulation  (percent  cor¬ 
rect)  for  unknown  messages  heard  once  (ref.  1).  In  the  present  study,  the  con¬ 
tribution  of  the  sentence  context  to  intelligibility  was  tested  using  two 
levels  of  linguistic  redundancy:  (1)  two  key  words  in  isolation,  for  example, 
"cargo  firs,"  versus  (2)  the  same  two  key  words  in  a  short  sentence  context, 
for  example’,  "You  have  cargo  compartment  fire." 

In  the  first  phase,  airline  pilots  listened  to  unfamiliar  messages  that 
differed  in  linguistic  redundancy,  and  that  were  presented  in  a  background  of 
continuous  weather  broadcast  at  seven  signal-to-noise  ratios  (S/N)  ranging 
from  -8  dB  to  +8  dB.  Pilot  performance  was  assessed  by  the  accuracv  of  com¬ 
prehension  for  the  two  kev  words  in  each  message  and  the  response  times  mea¬ 
sured  from  the  end  of  each  message  to  the  pilot's  signal  of  comprehension.  In 
the  second  phase,  pilots  first  studied  a  list  of  the  same  messages  while  lis¬ 
tening  to  them  and  were  then  given  recognition  tests  with  and  without 
competing  weather  broadcast  at  S/N  of  0  dB. 

Analysis  of  the  data  showed  that  monosyllabic  key  words  in  two-word  for¬ 
mat  were  significantly  harder  to  understand  than  the  same  key  words  heard  in 
short  sentences.  Key  words  in  sentences  were  approximately  20  percent  more 
intelligible  than  key  words  presented  alone  and  response  times  to  sentences 
were  approximately  1  sec  shorter  than  response  times  to  two-word  messages 
across  all  S/N  conditions.  For  polysyllabic  key  words,  the  differences  in 
intelligibility  and  response  times  for  the  two  levels  of  linguistic  redundancy 
were  smaller  but  in  the  same  direction. 

It  was  concluded  that  synthesized  speech  messages  for  the  cockpit  should 
be  worded  in  the  form  of  short  sentences  or  phrases,  such  as  "Th e^uel  pres¬ 
sure  is  low,"  instead  of  in  abbreviated  key  word  format,  such  as  jue^  ~cw. 
This  design  principle  is  especially  important  when  the  key  words  are 
monosyllabic . 


^National  Research  Council  Associate 
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INTRODUCTION 


Miller,  Heise,  and  Lichten  (ref.  2)  presented  convincing  evidence  that 
words  in  a  sentence  context  are  more  intelligible  than  the  same  words  in  iso¬ 
lation.  Their  finding  applies  to  speech  heard  without  the  benefit  of  any  par¬ 
ticular  real-world  context.  The  authors  also  found  that  as  the  number  of 
possible  alternative  interpretations  decreased,  due  to  the  context  of  the 
situation  or  to  knowledge  or  the  message  set,  message  intelligibility 
increased.  These  findings  could  be  extended  to  predict  that  listeners  famil¬ 
iar  with  a  particular  situation,  such  as  flvine  an  aircraft,  and  with 

a  particular  type  of  well-learned  phraseology,  such  as  Air  Traffic  Control 
(ATC)  phraseology  and  cockpit  terminology,  would  exhibit,  for  messages  per¬ 
taining  to  the  flight  situation,  little  or  no  intelligibility  differences 
between  words  presented  in  isolation  and  words  presented  in  sentences.  It  has 
been  found  in  a  previous  study  (ref.  1)  that  pilots  do  indeed  have  an  advan¬ 
tage  over  nonpilots  in  understanding  flight  phraseology.  The  messages  in  that 
study  were  short  ATC  clearances  and  cockpit  warnings  and  advisories.  For  both 
human  and  synthesized  speech,  pilots  familiar  with  the  type  of  phraseology, 
but  not  the  particular  message  set,  understood  the  cockpit  messages  better 
than  did  a  control  group  of  nonpilot  police  patrolmen,  even  though  both  groups 
were  equally  able  to  understand  messages  relating  to  commonly  experienced 
events  for  both  human  and  synthesized  speech. 

The  wording  of  cockpit  voice  communications  -  including  voice  warnings  - 
must  be  chosen  such  that  the  messages  are  understood  accurately,  rapidly,  with 
few  repetitions,  and  with  as  little  demand  on  pilot  mental  workload  and  atten¬ 
tion  as  possible.  The  present  study  was  designed  to  determine  whether  the 
influence  of  familiarity  with  flight  phraseology  w^uld  be  so  strong  as  to  ouc- 
weigh  the  advantage  that  the  sentence  context  was  found  to  have  in  the  Miller, 
Heise,  and  Lichten  study  (ref.  2)  where  the  listeners  were  airline  pilots  pre¬ 
sented  with  synthesized  speech  cockpit  warnings.  If  short  sentences,  with 
their  additional  linguistic  redundancy,  fulfilled  the  requirements  listed 
above  better  than  did  the  key-word-only  messages,  then  the  longer,  sentence- 
length  format  would  be  preferable  for  cockpit  voice  warnings. 

A  solution  to  this  question  is  crucial  for  the  following  reasons. 

Brevity  has  been  highly  valued  for  speech  communications  in  the  cockpit 
because  it  is  believed  to  reduce  the  length  of  the  comprehension  period.  It 
does  reduce  the  probability  of  temporal  overlap  with  other  auditory  events. 
Also,  brevity  of  wording  conforms  to  the  style  of  current  cockpit  communica¬ 
tions  with  ATC  and  inside  the  cockpit  for  checklists  and  crew  conversations. 
Finally,  for  warnings  presented  in  digitized  speech,  each  additional  word 
represents  additional  cost  in  terms  of  dollars  and  additional  read-only-memory. 
A  corresponding  additional  hardware  and  memory  cost  for  extra  words  for  syn¬ 
thesized  speech  is  not  a  problem  since  additional  words  are  obtained  via  soft¬ 
ware  modifications  and  memory  increments  on  the  order  of  only  100  bits  per 
spoken  word.  The  human  factors  "cost"  for  using  brevity  in  cockpit  voice 
warnings  may  be  low  intelligibility  and  high  demands  on  pilot  attention  for 
comprehension. 
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tor  a  comparison  of  the  effectiveness  of  key-word-only  messages  to 
sentence-length  messages,  airline  pilots  listened  to  both  types  of  message 
wording  with  and  without  competing  weather  broadcast.  The  study  was  divided 
into  two  phases.  For  Phase  1,  the  pilots  were  given  no  familiarization  with 
the  message  set  before  testing  and  received  no  feedback  on  successive  trials, 
for  Phase  2,  using  the  same  messages,  pilots  first  obtained  familiarization 
with  the  warnings  by  listening  to  them  while  reading  them  from  a  printed  list, 
-hen  the  pilots  were  tested  tor  their  ability  to  recognize  the  warnings  with 
and  without  competing  weather  broadcast. 


Subjects 


Sixteen  airline  pilots  participated  in  Phase  1.  Seven  of  them  returned 
5  months  later  for  Phase  2,  along  with  an  eighth  pilot  who  had  participated  in 
-ar lier  voice  warning  research  using  the  same  speech  synthesizer.  The  pilots 
ages  ranged  from  31  to  44  years.  They  were  paid  for  their  participation. 


Test  Materials 

Sixteen  two-word  warnings  relating  to  airline  transport  operations  were 
composed.  Eight  messages  contained  monosyllabic  words  and  eight  contained 
polysyllabic  words.  For  each  two-word  warning,  an  appropriate  sentence-length 
message  was  constructed  that  contained  the  same  two  key  words  presented  in  the 
same  order  as  in  the  two-word  version.  Both  versions  of  the  messages  are 
presented  in  table  1. 

TABLE  1.-  COCKPIT  WARNING  MESSAGES  AND  MESSAGE  DURATIONS  IN  SECONDS. 


j  MONOSYLLABIC  WORDS 

DURATION 

MONOSYLLABIC  WORDS  IN  SENTENCES 

DURATION 

■  1  FUEL  LOW 

0  5b 

1  THE  FUEL  PRESSURE  IS  LOW 

1  63 

:  2  gear  down 

52 

2  THE  LANDING  GEAR  IS  DOWN 

1  75 

!  3  CHECK  FLAPS 

66 

3.  CHECK  YOUR  FLAPS  SETTING 

190 

!  4  BRAKES  FAILED 

6  R 

4  THE  WHEEL  BRAKES  HAVE  FAILED 

121  1 

;  5  MAIN  DOOR 

59 

I  5  THE  MAIN  DOOR  iS  NOT  LOCKED 

1  28  1 

:  6  BOOST  PUMP 

54 

6  THE  FUEL  BOOST  PUMP  IS  OUT 

1  39  } 

'  7  OIL  :»OT 

57  ; 

7  THE  OIL  TEMPERATURE  IS  HOT 

1  59  ! 

j  3  ICE  VALVES 

47 

8  THE  WING  ANTI  ICE  VALVES  ARE 
INOPERATIVE. 

2  19  j 

I  MEAN 

57 

MEAN 

1  6?  1 

POLYSYLLABIC ’WORDS  DURATION 

POLYSYLLABIC  WORDS  IN  SENTENCES  DURATION! 

j  1.  AUTOPILOT  DISENGAGED 

0  98 

1.  THE  AUTOPILOT  IS  DISENGAGED. 

1  36 

2.  DME  MALFUNCTIONING 

1  52 

2  YOUR  DME  IS  MALFUNCTIONING. 

2  15 

!  3  CABIN  PRESSURE 

61 

3.  THE  CABIN  PRESSURE  IS  LOW 

1  68 

4  SPOILERS  INOPERATIVE 

1.18 

4  YOUR  SPOILERS  ARE  INOPERATIVE 

2  23 

i  5.  TERRAIN  CLOSURE 

86 

5.  YOU  HAVE  RAPID  TERRAIN  CLOSURE 

2  18 

!  6  CARGO  FIRE 

.74 

6  YOU  HAVE  CARGO  COMPARTMENT  FIRE 

2  42 

7  V'lNDOW  OVERHEATED 

65 

7  THE  COCKPIT  WINDOW  IS  OVERHEATED 

151 

1  8  HYDRAULIC  FAILURE 

1.05  ! 

8  YOU  HAVE  HYDRAULIC  SYSTEM  FAILURE 

1.89 

.  MEAN 

1 _ _ _ — 

95 

MEAN 

1  93 
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The  warnings  were  synthesized  on  a  VOTRAX  VS-6  speech  synthesizer1  using 
identical  phoneme  coding  for  the  key  words  in  their  two-word  and  in  their 
sentence-length  versions.  This  coding  ensured  that  the  key  words  were  pro¬ 
nounced  identically  in  both  types  of  message  wording.  A  tape  of  the  messages 
can  be  obtained  from  the  author. 

Continuous  weather  broadcast,  used  as  background  "noise",  was  recorded 
from  a  local  weather  station  operated  by  the  National  Weather  Service.  This 
recording  was  then  edited  to  eliminate  pauses  in  the  speech. 


Testing  Environment 

Pilots  were  tested  individually  while  sitting  in  a  sound-treated  booth. 
The  warning  messages  and  the  weather  broadcast  were  individually  adjusted  for 
desired  peak  sound  pressure  level  (SPL)2  and  fed  through  an  audio  mixer.  The 
summed  output  of  the  mixer  was  amplified  and  presented  to  the  pilots  binau- 
rallv  via  Koss  ESP-9  electrostatic  headphones.  The  pilots  sat  at  a  table 
which  contained  a  reaponse  box  with  a  button  for  signaling  message  comprehen¬ 
sion  (used  in  Phase  1)  and  a  push-to-talk  microphone  for  message  readback 
(used  in  Phase  2).  A  PDP-12  computer  was  programmed  to  deliver  the  weather 
broadcast  and  the  warnings  and  to  record  response  times. 


Experimental  Conditions 

Phase  I-  For  Phase  1,  the  pilots  had  no  prior  experience  with  the  message 
set.  Each  pilot  was  assigned  a  message  set  containing  eight  two-word  warnings 
and  eight  sentence-length  warnings.  Pilots  were  alternately  assigned  to  one 
of  two  groups.  Group  1  pi]ots  were  given  two-word  mono  syllabic  warnings  and 
sentence-length  warnings  containing  polysyllabic  words.  Group  2  pilots  heard 
two-word  polysyllabic  warnings  and  sentence-length  warnings  containing  mono¬ 
syllabic  key  words.  In  this  way,  each  pilot  was  tested  on  all  16  pairs  of  key 
words  but  no  pilot  received  the  same  key-word  pairs  in  both  the  two-word  and 
the  sentence-length  versions.  For  a  baseline  comparison  of  the  ability  of  the 
two  groups  to  understand  the  speech  produced  by  the  synthesizer,  all  the 
pilots  were  given  a  pre-test  consisting  of  16  unfamiliar  short  ATC  clearances 
and  a  post- test  containing  16  unfamiliar  cockpit  advisories,  without  competing 
weather  broadcast. 

Each  pilot  was  given  repeated  trials  on  his  entire  message  set  of  eight 
two-word  and  eight  sentence-length  warnings  at  seven  levels  of  increasing 
signal-to-weather  broadcast  (S/W)  and  peak  SPL  ratios  of  -8,  -5,  -3,  0,  +3, 

+5,  and  +8  dB.  The  weather  broadcast  was  held  constant  at  60  dB  SPL,  and  the 
SPL  of  the  warnings  was  varied  from  52  dB  to  68  dB  to  obtain  the  desired  S/W. 
For  the  eighth  and  final  trial,  the  16  warning  messages  were  presented  at 
60  dB  SPL  without  weather  broadcast.  The  experimental  conditions  for  Phase  1 
are  shown  in  Table  2. 


invented  by  Richard  T.  Gagnon  and  manufactured  by  Vocal  Interface 
Division  of  Federal  Screw  works,  Troy,  Michigan. 

^Referenced  to  0.0002  dyne/cm2. 
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TABLE  2.-  EXPERIMENTAL  CONDITIONS 
FOR  PHASE  1. 


1 

8  TWO  WORD 

8  SENTENCE 

WARNINGS 

WARNINGS 

, —  . 

1 

POLYSYLLABIC  1 

GROUP  1 

MONOSYLLABIC 

KEY 

'  3 

ist  Y 

WORDS 

!  airline 

WORDS 

IN 

■  pilots 

SENTENCES 

r . 

MONOSYLLABIC 

i  GROUP  2 

POLYSYLLABIC 

KEY 

3 

KEY 

WORDS 

•  airline 

WORDS 

IN 

:  pilots 

SENTENCES 

The  order  of  presentation  of  the 
two-word  messages  and  the  sentence-length 
messages  was  balanced  across  pilots  so 
that  half  the  pilots  in  each  group  heard 
sentences  before  the  two-word  format  for 
all  trials  and  the  remainder  heard  two- 
word  format  before  sentences.  The  mes¬ 
sages  were  presented  to  each  pilot  ir.  a 
different  random  order  for  each  trial. 

As  they  arrived  for  testing,  pilots  were 
assigned  alternately  to  either  the 
sentences- first  order  or  to  the  words 


LISTENING  CONDITIONS 


•TUNING  52  w  5?  go  63  65  68  60 

SPL  ulB'  . .  . . _ . 

WEATHER  50  50  60  60  60  60  60  - 

SPL  U0)  _ 

r  N  -8-5  -3  0  3.5  3  40 

!  TRIAL  No  1  ?  3  4  5  6  7 _ 

SOUND  PRESSURE  LEVELS  IN  DECIBELS  PE  0002  riyo*  cm- 


first  order. 

For  trials  1  through  7,  continuous 
weather  broadcast  occurred  throughout 
each  trial.  For  each  SPL,  warning  mes¬ 
sages  were  presented  one  at  a  time  at 
40-sec  intervals.  Pilots  responded  by 
pressing  a  button  labeled  "Understand 
as  soon  as  they  believed  they  understood 


each  message.  Thev  then  wrote  the  message  on  a  prepared  answer  sheet.  If 
thev  did  not  understand  a  message,  they  were  instructed  to  think  about  it  for 
a  while  and  then  press  the  button  and  either  write  their  best  guess  or  th 
word  "no."  Response  times  from  the  end  of  each  message  to  the  pilot 
"Understand"  button  depression  were  recorded.  The  procedure  for  the  eighth 
and  final  trial  and  for  the  pre-  and  post-tests  was  the  same  except  that  no 
competing  weather  broadcast  was  used  and  the  SPL  for  all  messages  was  60 


v'ncse  2-  The  sama  messages  used  in  Phase  1  were  also  used  in  Phase  2. 

Four  pilots  were  assigned  to  the  monosyllabic  two-word  messages  and  the  po  u- 
svl1 abic  sentence-length  messages.  The  other  tour  pilots  heard  the 
sviiabic  two-word  messages  and  the  monosyllabic  sentence-length  messages. 
\ssLnment  of  pilots  to  message  sets  was  such  that  the  seven  “-”*1?®11- 

in  Phase  1  heard  the  opposite  message  set  in  Phase  ?.  than  they  had  heard  m 
Phase  1  As  in  Phase  1,  the  order  of  words-first  versus  sentences-first  was 
balanced  across  pilots.  Each  pilot  was  given  a- different  presentation  order 
■  p.rh  of  three  listening  conditions.  The  first  listening  condition  was  a 
familiarization  session.  While  looking  at  a  printed  message  list,  the  pilot 
heard  each  message  presented  twice  in  succession.  Next,  he  received  a  recogni 
tion  test  in  which  the  messages  were  presented  only  once  at 
in  a  different  order  and  with  no  competing  weather  broadcast.  The  pilot 
resDonded  bv  keying  a  microphone  and  reading  back  each  message  as  rapi  -V  as 
oossible  Sen  writing  it  down  on  an  answer  sheet.  Finally,  he  was  tested  on 
the  same* messages  presented  in  a  third  order  and  embedded  in  weather  broadcast 
at  a  S/W  of  0  dll.  Again,  the  pilot  responded  by  keying  a  microphone  and  then 
reading  back  and  writing  down  the  message.  The  weather  broadcast  and  warning 
message  were  delivered  concurrently  with  the  weather  leading  the  onset  of  the 
warning  by  approximately  950  msec  and  continuing  until  the  end  of  the  wa™1£S- 
In  intervll  of  40  sec  was  used  to  reduce  the  changes  that  the  pilots  would  be 
p«^e!i  ?or  successive  presentations.  All  ..ss.ges  and  the  veather  broadcast 


298 


TABLE  3.-  EXPERIMENTAL  CONDITIONS 
FOR  PHASE  2. 


— - - 

3  TWO  WORD 

8  SENTENCE 

WARNINGS 

WARNINGS 

4 

AIRLINE 

PILOTS 

MC  OSVLLABlC 
WORDS 

POLYSYLLABIC  ! 

WORDS  j 

IN  SENTENCES  1 

4 

AIRLINE 

PILOTS 

POLYSYLLABIC 

WOROS 

monosyllabic 

WORDS 

IN  SENTENCES 

listening  CONDITIONS 


STANDARD  DEVIATION. 

. MEAN 

SCORES  TOR 

PILOT*  that 

RETURNED  FOR 
PHASE  It 


Figure  1.-  Mean  articulation  scores 
(percent  correct)  for  the  two 
Pilot  Groups  for  synthesized 
speech  Pre-  and  Post-tests. 

N  »  8  pilots  per  group. 

TABLE  4.-  COMPARISON  OF  RESPONSE 
TIMES  FOR  PRE-  AND  POST-TEST 


were  delivered  at  approximately  50  dB  SPL. 
Table  3  shows  the  experimental  conditions 
for  Phase  2. 


RESULTS 


Group  Differences 

Wilcoxon's  Sum  of  Ranks  for  the  Pre¬ 
test  (R  3  68,  n^  3  8,  n2  -  8,  p  >  0.10) 
and  the  Post-test  (R  =  68.  ni  -  8,  ni  -  8, 
p  >  0.10)  indicated  that  the  two  groups 
of  pilots  were  not  significantly  differ¬ 
ent  in  their  ability  to  understand  the 
synthesizer.  Means  and  standard  devia¬ 
tions  for  the  two  groups  for  pre-  and 
post-test  articulation  scores  (percent 
correct)  are  shown  in  Figure  1.  However, 
the  two  groups  were  significantly  differ¬ 
ent  (t  3  2.27,  df  3  14,  p  <  0.05)  in 
average  response  times  to  the  pre-test 
messages.  They  were  not  significantly 
different  (t  3  1.16,  df  =  1 4,  ?  >  0.10) 
in  average  response  times  for  the  post¬ 
test  messages.  Table  4  shows  the  means 
and  standard  deviations  in  reooonse  time 
for  the  two  groups  for  the  pro-  and  post¬ 
test.  Because  of  the  initial  response 
time  difference  between  the  two  groups, 
which  may  have  continued  for  an  indeter¬ 
minable  number  of  trials  on  the  test  mes¬ 
sages,  the  response  time  data  was 
normalized  using  individual  subjects 1 
means  and  standard  deviations  across  mes¬ 
sage  type  so  that  between-group  compari¬ 
sons  could  be  made  for  the  same  key-word 
pairs  presented  in  isolation  and  in 
sentence  context. 


Presentation  Order 

Analysis  using  Wilcoxon’s  Sum  of 
Ranks  determined  that  presentation  of 
sentences  or  words  first  had  no  signifi¬ 
cant  effect  on  the  articulation  scores 
for  words  (R  =  59.5,  ni  =  8,  n2  =  8, 
p  >  0.10)  or  for  sentences  (R  3  68, 
m  3  8,  n2  3  8,  p  >  u.lG).  Similarly , 
there  were  no  significant  presentation 
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o' - id  ii  0  3  £  3  ALCNE  ALONE  0 

UNFAMILIAR  LEARNED 

SIGNAL  to  weather  hat  :o  tlS 

igu re  2.-  Mean  articulation 
scores  for  key  monosyllabic 
words  in  unfamiliar  synthe¬ 
sized  speech  cockpit  warnings 
with  and  without  a  sentence 
context,  heard  in  background 
of  continuous  weather  broad¬ 
cast  for  repeated  trials  at 
increasing  signal-to-Mnoise" 
ratios  (Phase  1)  and  for  the 
same  messages  with  and  without 
competing  weather  broadcast 
after  prior  learning  of  the 
message  set  (Phase  2). 


2  O' - ir”5 - 3 - 0  3  5  tTCcXE  ALONE  0 

z  unfamiliar  learned 

SIGNAL  TO  NOISE  RATIO  26 


Figure  3.-  Mean  articulation 
scores  for  key  polysyllabic 
words  in  unfamiliar  synthe¬ 
sized  speech  cockpit  warnings 
with  and  without  a  sentence 
content,  heard  in  background 
of  continuous  weather  broad¬ 
cast  for  repeated  trials  at 
increasing  signal-to-"noiseM 
ratios  (Phase  1)  and  for  the 
same  messages  with  and  without 
competing  weather  broadcast 
after  prioi  lealuiug  of  the 
message  set  (Phase  2) . 


order  effects  on  response  times  to  words 
(R  =  52,  m  =  3,  n:  -  8,  p  >  0.10)  or  for 
response  times  to  sentences  (R  -  58, 
ni  =  3,  n2  =  3,  p  >  0.10). 


Articulation  Scores 

Articulation  scores  are  the  percent 
of  kev  words  correctly  ideal if lea  for  each 
message  wording  type. 

:*ckc syllabic  words-  Figure  2  shows  a 
be tween-groups  comparison  of  articulation 
scores  for  monosyllabic  key  words  in  the 
two-word  and  sentence-length  versions. 

The  eight  trials  at  increasing  S/W  for 
Phase  1  are  on  the  right,  and  the  two 
testing  trials  of  Phase  2  are  on  the  left. 
The  higher  mean  articulation  scores  indi¬ 
cated  that  for  unfamiliar  messages,  the 
key  words  in  a  sertence  context  were  sig¬ 
nificantly  more  intelligible  than  the  same 
key  words  presented  alone  across  all  S  /W 
conditions,  significance  being  determined 
by  Wilcoxcn’s  Signed  Ranks  Test  (R  =  0, 
no.  of  pairs  (np)  =  8,  p  <  0.01).  In 
Phnse  2.  when  pilots  were  given  prior 
familiarization  with  the  message  set  and 
tested  for  recognition  of  the  “learned11 
messages,  all  pilots  scored  100  percent 
articulation  for  all  messages  of  both 
tvpes  of  wording  when  no  competing  weather 
broadcast  was  present.  With  weather  broad¬ 
cast  present,  mean  articulation  for  words 
presented  alone  dropped  to  78  percent,  but 
onlv  dropped  to  94  percent  when  the  same 
words  were  presented  in  a  sentence  context. 
This  difference,  however,  did  not  reach 
significance  as  measured  by  WilcoxonTs 
Signed  Ranks  Test  (R=2.5,  np  =  6,  p>0.10). 
Onlv  six  pairs  were  used  because  the  dif¬ 
ference  scores  within  two  of  the  pairs 
were  zero. 

Polysyllabic  words-  The  mean  articula¬ 
tion  scores  for  unfamiliar  polysyllabic 
words  with  and  without  sentence  context  in 
Phase  1  are  similar  to  those  for  the  mono¬ 
syllabic  words  at  S/W  of  +3  dB  and  lower, 

.  _  -  ^  u  _  1  _  ,T .  _  _?  1  -  -  c  •  -  ^ 

<aS  aiiuWu  un  u Lie:  lc  *.  i_  l.'.l  -l  j  • 

Across  all  S/W  conditions,  however,  the 
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Figure  4.-  Response  times  for 
Pilot  Group  1  for  cockpit 
warnings:  Monosyllabic 

words  alone  and  Polysyllabic 
words  in  sentences  presented 
for  repeated  trials  at 
increasing  signal-  to-weather 
ratios.  Response  times 
measured  from  end  of  warning 
to  pilot1 a  "Understand" 
signal  (Phase  1) . 


Figure  5.-  Response  times  for 
Pilot  Group  2  for  cockpit 
warnings :  Polysyllabic 

words  alone  and  Monosyllabic 
words  in  sentences  presented 
for  repeated  trials  at 
increasing  sign' 1-to-weather 
ratios.  Response  times 
measured  from  end  of  warning 
to  pilot’s  "Understand" 
signal  (Phase  1). 


addition  of  a  sentence  context  did  not 
significantly  improve  (R  =  3,  nD  *  8, 
p  >  0.10)  intelligibility  for  polysyllabic 
key  words  in  initially  unfamiliar  messages. 
For  previously  learned  warnings  (Phase  2), 
articulation  scores  were  again  100  percent 
for  both  types  of  wording  presented  with  no 
competing  weather  broadcast.  When  weather 
broadcast  was  presented  at  S/W  of  0  dB, 
articulation  scores  for  words  presented 
alone  dropped  to  92  percent  and  articulation 
for  words  in  sentences  dropped  to  81  percent 
The  difference  between  articulation  sco1”  _ 
for  words  alone  and  in  a  sentence  context 
was  again  not  significant  (R  =  G.5,  np  =  b, 

p  >  0.10). 


Response  Times 

The  response- time  data  closely  paral¬ 
leled  the  intelligibility  data  in  Phase  1. 
The  response  times  were  measured  from  the 
end  of  each  message  to  the  time  when  the 
pilot  pressed  the  "Understand"  button  and 
began  his  response,  regardless  of  accuracy. 
For  Pilot  Group  1  (fig.  4)  and  Pilot  Group  2 
(fig.  5),  the  mean  response  time  for  chi' 
sentei.ce-length  messages  was  significantly 
shorter  (Group  1:  R  =  0,  np  =  7,  p  <  0.05: 
Group  2:  R  *  3,  np  *  8,  r  <  0.05;  than  the 
mean  response  time  for  the  two-word  messages 
For  Group  1,  response  time  data  was  avail¬ 
able  for  only  7  of  the  8  pilots  since  one 
pilo  misunderstood  the  instructions  and 
pressed  the  "Understand"  button  after  he 
finished  his  responses. 

Included  in  the  response  time  data  are 
several  types  of  responses: 

1)  incorrect  response, 

2)  "unable  '  response, 

3)  message  correct  for  the  first  rime, 

4)  message  correct  on  a  previous  trial 
and  correct  again  on  the  current 
trial. 

It  would  be  reasonaoLe  to  expect  differences 
in  response  times  as  a  function  of  the  ryp<=. 
of  response  the  pilot  mad*'  Therefore,  mean 
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\  -  WORDS  IN  SENTENCES 

\  MONOSYLLABIC  WORDS 
\ 

\  _ 
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MONOSYLLABIC  WORDS 


SIGNAL  TO  WEATHER  RATIO  .t6 


Figure  6.-  Response  times  for 
cockpit  warnings  recognized 
correctly  and  recognized 
correctly  on  previous  trial 
for  repeated  trials  at 
increasing  signal-to-weather 
ratios  (Phase  1). 


response  times  were  calculated  only  for 
responses  of  type  4,  that  is,  for  messages 
^hich  a  given  pilot  had  understood  correctly 
and  had  also  understood  correctly  on  the 
previous  trial  at  a  lower  o/W.  The  assump¬ 
tion  was  that  for  the  small  message  set  used 
in  this  study,  once  a  pilot  correctly  under¬ 
stood  a  particular  warning,  he  was  likely  to 
remember  it  during  the  next  trial  and  so 
facilitate  the  recognition  process.  The 
response  time  to  such  "previously  understood 
and  now  familiar"  messages  should  allow  a 
valid  comparison  between  words  presented  in 
isolation  and  those  presented  in  a  sentence 
context,  for  familiar  messages.  In  general, 
the  response  times  to  the  familiar,  cor¬ 
rectly  recognized  warnings  were  shorter  for 
both  Pilot  Groups  (figs.  6(a)  and  6(b))  than 
were  the  response  times  to  all  messages 
regardless  of  accuracy  (figs.  4  and  5). 


increasing  signal  to-weat  er  Because  of  group  differences  in  mean 

ratios  (Phase  1).  response  times,  each  pilot’s  response  times 

were  normalized.  The  resulting  z-scores  were  then  used  for  between-group  com¬ 
parisons  of  the  effects  of  sentence  context  on  monosyllabic  (fig.  /)  and  poly¬ 
syllabic  (fig.  8)  words.  In  each  case,  the  response  times  for  each  S/W  were 

significantly  longer  (Monosyllabic  words:  R  =  0,  np  -8.  P  <  0.01;  Polysy  - 
®  ,nrAa.  r  =  0  n„  =  8,  p  <  0.01)  for  the  two-word  messages  than  for  the 

sentence- length  messages.  The  size  of  the  response  time  difference  diminished^ 
slightly  for  the  polysyllabic  words  at  positive  S/W  conditions,  but  the  di  e 
ence  occurred  at  all  S/W  conditions  for  both  mono-  and  polysyllabic  words. 

Four-way  Comparison  of  Message  Types 

In  addition  to  the  major  comparisons  of  articulation  scores  and  response 
times  for  the  mono-  and  polysyllabic  words  with  and  without  a  sentence  context. 


V  MONOSYLLABIC 
\  WORDS  ALONE 


\  POLYSYLLABIC 
\  WORDS  ALONE 


SENTENCES  WITH 
MONOSYLLABIC  WORDS 


SENTENCES  WITH 
POLYSYLLABIC  WORDS 


SIGNAL  TO  WEATHER  RATIO.  dB 

Figure  7.-  Z-scores  for  response  times 
for  "familiar"  cockpit  warnings 
containing  monosyllabic  words,  with 
and  without  sentence  context,  pre¬ 
sented  for  repeated  trials  at 
increasing  signal-to-voathcr 
ratios  (Phase  1) . 


SIGNAL  TO  WEATHER  RATIO.  d8 


Figure  8.-  Z-scores  for  response  times 
for  "familiar"  cockpit  warnings 
containing  polysyllabic  words,  with 
and  without  sentence  context,  pre¬ 
sented  for  repeated  trials  at 
increasing  signal— to— weather 
ratios  (Phase  1), 
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►  .--no.-  nil  nr  Derformance  for  monosyllabic  words 

it  is  interesting  to  note  tnat  P  P rformance  for  the  other  three  presenta- 

presented  alone  (MW)  is  poorer  th  P  .  polysyllabic  words  alone 

tion  types:  monosyllabic  words  -fences  (PS)  Peak  intelligibility  for  MW 
(PW),  and  polysyllabic  words  1  enC  for  MS  (fig.  2)  and  95  percent  and 

was  only  8%per^anS°^a”e9peccivelv  (fig.  3).  Similarly,  the  z-scores  for 
97  percent  for  PW  and  PS,  respecti  ,  v  »  well  above  che  mean 

response  times  to  familiar  messages  for^th^Ust  tri^  ^  ^  (_Q  ^  ^  pg 

for  MW  (+0.46)  and  below  the  mean  <  ;esp0nse  times  to  the  monosyllabic 

(-0.5S)  (fig-  8).  These  data  *£“*5  v  longer  than  response  times  to  poly- 

and^  ionl  and  polysyllabic  words  presented 

in  sentences. 

DISCUSSION 


Th.  reader  is  cautioned  against  “ 

response  time  as  a  direct  unccio oara(jigm*used  in  this  study  makes  use  of  the 

with  learning.  The i^its^aiso  known  as  the  ascending  method  of  limits 
modiried  method  of  limi  ,  obtaining  onlv  the  relative  thresholds  ter 

(ref.  3).  It  is  for  ™  or  more  classes  of  stimuli  which  are 

some  variable,  such  as  S/N  rati  »  knowledge  of  the  total  set  of 

co  be  identified  by  the  tSal.  on  alternating  lists  of 

response  alternatives.  By  Pre^  increasing  intensities,  durations,  or  S/N 

the  different  classes  of  stimu  ^  different  classes  of  stimuli  under 

ratios,  the  relative  threshol  be  obtained  while  minimizing 

equal  conditions  of  "f^Luarltv  with  the  individual  stimuli.  If  anything, 
the  effects  or  subject . J  with  the  message  set  would  be  expected 
the  effects  of  increasing  f  ^  chjmceg  of  significant  intelligibility 

at  higher  levels  of  S/W  t  d  che  two  levels  of  linguistic  redundancy 

and  response-time  differences  oetw^ 
for  message  wording. 

i  int-pllisibilitv  scores  are  higher  and  reaction  times  are 

Nevertheless,  intellgbl^^y  warnings  presented  as  short  sentences 

shorter  for  synthesize  p  ,  .  briefer  two  key-word  format.  For 

chan  they  are  for  warn^Ja  PJ  ith  cockpit  phraseology,  this  relationship  is 
listeners  who  are  experien  broadcagt  ratios.  The  advantage  of  the  sen- 

true  across  a  wide  range  o  J  8  words  chan  for  polysyllabic  words  in 

tence  context  is  larger  for  .  effect  of  the  added  linguistic 

that  for  polysyllabic  wor  s^  at  s/w  cf +3  dB  or  higher.  Famili- 

redundancy  decreases  or  pernap  PPeg  reaction  time  and  increases  intelli- 

arity  with  Indiv.-ual  «  facilitating  effect  of  the  sentence  context 

gibility  but  o  .-or  overr  Monosyllabic  words  alone  produce  con¬ 

fer  recognition  and  response  ti  •  Qf  ^war  intelligibility  and  longer 
sistently  Potre^P®r^™a^gVuabiC  words  alone  and  both  monosyllabic  and 
response  times  than  do  poljsyl  fonnanCe  relationships  is  consistent 

sr:"  ”««i:”:reiyauc,.bie  «.« 

"make  sense"  when  additional  syllables  are  included. 


\ 

i 


,  af.  ,i._  additional  linguistic  redundancy  provided 

It  would  seem,  then,  that  h  d^t  ^  fey  the  additionai  syllables  in 

bv  the  sentence  context,  an  synthesized  warning  messages  more 

the  polysyllabic  words,  helps  P  Pthg  two_word  format  is  used.  This  con- 

accurately  and  more  quic  -  subsequent  study,  also  presented  at  the  12th 

elusion  is  further  suppor  e  y  an(j  using  time  estimation  as  a 

Annual  Conference  on  Manua  on  °  '  redundant  sentences 

secondary  task,  in  which  at  s  also  found  that  total  listen- 

required  less  attention  f or  p  ^n^S  ^  ^  warning  with  the  warnings 
ing  time  -  ^asured  rrom  the  message  t0  signal  that  he  understood  - 

repeated  un“L  J  than  for  tWo-word  messages,  because  the  pilots 

was  shorter  for  sentences  ,  than  the  sentence-length 

allowed  the  two-word  warning,  to  repeat  "atio  ^  sentence  to  two-word 

warnings.  This  dit  f™c;  Thc  „p.i rational  implication.  of  this  find- 

warning-message  durati  -•  che  actuaL  duration  0f  any  sentence 

ing  are  strengthened  by  the  Eac  u  would  be  slightly  shorter  than  the 

,-arnings  synthesized  tor  use  i  durations  of  the  key  words  in  this  study 

durations  used  m  this  stu  .  ,  d  the  words- in-sentences  versions  of 

were  .ad.  id.ntica!  forthejrds-.lone  and^  ^  ^  ^ 

the  warnings  so  as  P  human  speech,  the  pronunciation  of  all  but  the 

cf  linguistic  redundance.  as0S  is  acCually  shorter  than  when  the  words 

final  words  in  6).  Additionally,  the  shortening  of  the  key 

are  pronounced  m  isolation  ^  timing  rhythm  of  the  sentence  pronuncia- 

words  in  the  sentences  wo  Id  P  in  still  higher  sentence  intelligi- 

of  the^perceptual  importance  of  proper  ri.ing  relationships 

among  the  segments  of  sentences  (ref.  7). 

,  „  .H.n  accuracy  and  decreased  response  time  for  the 

If  the  increased  recos  la  taken  as  an  Indicator  of  lea.  cogni- 

r^rSTp^ns  ?e. .entente,  then  it  i.  -^"^d  he 

£££  -  ---  — '  ”pidly: 
possibly  with  only  a  single  repetition. 

. .  ennoorted  bv  judgments  from  the  eight  pilots  who 
This  suggestion  is  ^orted^,  ^  t0  judge  the  difficulty  of  learning 

participated  m  base  V  wording  in  the  experimental  situation 

and  recognizing  the  two  types  of  2^”“®“ordlng  in  the  cockpit, 

and  to  give  their  preferences  o  •*  wording  equally  easy  to  learn  and 

While  most  of  the  pilots  found  ^  ^J  epeeX live  of  the  eight  thought 
to  recognize  in  the  absence  o  P  lza  during  simultaneous  weather  broad- 

that  the  sentences  were  easier  t  8  sentence-type  wording  over  the 

cast,  and  six  of  the  eight  wou  p  ^  Their  reasons  for  this  prefer- 

two-word  format  for  cockpi-  S  nn^n-pnded  auestion,  "why  do  you  think 

ence  were  obtained  in  response  to  an  ?M  The  reasons  the  pilots 

were  easier  than  tne  _ .  ■  -*  ■» 

for  prefixing  sendees  Co  fwo-word  ««««  £«U  into  chree  general 

categories.  ^  ^  aente„ce  help  you  figure  out  the  word,  you  missed. 

a  .  j  r  sentence  get  your  attention  and  cue  you  in  to 

2)  The  first  ^.^"r^kW.o  that  you  are  listening  for  the 

critical  words. 
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TABLE  5.-  AIRLINE  PILOT  PREFERENCES  FOR 
COCKPIT  WARNING  MESSAGE  WORDING:  TWO 
KEY  WORDS  VS  A  SENTENCE  (PHASE  2). 


|  TWO  WORD 

i  FORMAT  EASIER 

j  SENTENCE  FORMAT  | 
EASIER 

NO 

DIFFERENCE 

N 

MESSAGE 

LEARNING 

1 

i  2 

i  o  i 

J  ; 

6 

8 

RECOGNITION 
WITH  NO 
COMPETING 

wx 

i 

T 

i 

2 

5 

8 

RECOGNITION 

DURING 

COMPETING 

WX 

1 

i  0 

5 

3 

8 

PREFERENCE 

IN 

COCKPIT 

] 

2* 

6 

0 

8 

♦THE  TWO  PILOTS  WHO  PREFERRED  THE  TWO  WORO  FORMAT  FOR  USE  IN  THE 
COCKPIT  THOUGHT  THE  LONGER  MESSAGES  WOULD  DISTRACT  THEM  FROM 
OTHER  FLIGHT  TASKS.  BOTH  FOUND  THE  FORMAT  WARNING  *  KEY  WORD  ♦  KEY 
WORP 


TABLE  6.-  AIRLINE  PILOT  RESPONSES  TO 
A  PROPOSED  COCKPIT  VOICE-WARNING 
SYSTEM  (PHASE  2). 

REPLACE  ALL  CURRENT  AURAL  WARNINGS  WITH  VOICE 

AUTOMATIC  VOLUME  CONTROL  (AVC)  TO  MAINTAIN  CONSTANT 
S.N  RATIO 

PRIMARY  DELIVERY  VIA  HEADSETS  AT  LEVEL  EQUAL  TO 
INDIVIDUAL  PILOT  SELECTED  COMMUNICATIONS  AUDIO  LEVEL 
BACK  UP  OELIVERY  VIA  SPEAKER  USING  AVC  TO  MAINTAIN 
CONSTANT  SIGNAL  TO  COCKPIT  NOISE  LEVEL 

PILOT  PREFERENCE 


YES  NO 

N 

ELIMINATE  ALL  BELLS,  j  KEEP  FIRE  ,  VOICE  OK  j 

HORNS  CLACKERS.  1  BELL  '  FOR  SOME  ;  NO  VOICE 

BUZZERS  ;  ONLY  WARNINGS 

- j - - ?' 

No  °F  4  j  3  |  1  j  0 

PILOTS  |  i 

8 

TABLE  7.-  AIRLINE  PILOT  PREFERENCES 
FOR  REPETITION  OF  COCKPIT  VOICE 
WARNINGS  (PHASE  2). 


3)  The  longer  pattern  of  the 
sentence  with  extra  words 
between  the  critical  ones 
gives  you  more  time  to 
understand  the  words. 

The  two  pilots  who  preferred  two- 
word  warnings  expressed  a  prefer¬ 
ence  for  brevity  of  verbal 
communications.  One  thought  that 
the  sentences,  being  longer,  would 
distract  his  attention  more  from 
his  flight  tasks  (but  see  ref.  5). 
Both  these  pilots,  however,  thought 
that  a  warning  message  format  would 
be  acceptable  if  it  consisted  of  an 
alerting  word  such  as  "warning" 
followed  by  a  sequence  of  key  words. 

These  eight  pilots  generally 
favored  the  concept  of  cockpit 
voice  warnings.  Table  6  shows 
their  responses  to  a  question  pro¬ 
posing  that  voice  warnings  replace 
all  current  aural  warnings  but  with 
the  provision  that  the  voice  warn¬ 
ings  be  presented  at  a  comfortable 
listening  level  adjusted  to  indi¬ 
vidual  pilot  preference  and  with 
back-up  via  speaker  and  automatic 
adjustment  of  voice  warning  volume 
relative  to  the  changing  cockpit 
noise  level.  Seven  of  the  eight 
pilots  favored  voice  for  all  cur¬ 
rent  aural  warning  situations. 

Three  of  the  seven  wanted  to  keep 
the  fire  bell  and  have  the  voice 
announce  the  engine  number.  The 
eighth  pilot  said  that  voice  would 
be  acceptable  for  some  warnings. 


SHOULD  VOICE  WARNINGS  BE  REPEATED’ 


YES  NO 

N 

8  0 

8 

NUMBER  OF  REPEATS  FOR  COCKPIT  VOICE  WARNINGS 


1 

|  UNTIL  CANCELLED  UNTIL  PROBLEM 

2  OR  3  ;  OR  CORRECTED 

PROBLEM  CORRECTED  !  =  NOT  CANCELLABLE 

N 

CRITICAL 

l  0 

!  in)  LANDING  GEAR 

1  ,  '  \  AND  FLAPS  ONLY 

8 

.  _  _  .  . 

1  ....  -  J  -■ -  -  -  t 

— 

NON  CRITICAL 

i  o 

i — 

!  4  !  4  i  0  1 

8 

Table  7  deals  with  warning 
repetition  and  cancellation.  All 
eight  pilots  thought  that  the  warn¬ 
ings  should  be  repeated  more  than 
once.  Preferences  for  the  number 
of  repeats  and  type  of  cancellation 
depended  to  some  extent  on  the 
criticality  of  the  warnings.  For 
critical  warnings,  seven  of  the 
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Figure  9.-  Airline  pilot  ratings 
of  clarity  of  pronunciation  of 
svnthesized-speech  cockpit 
warnings  after  learning  mes¬ 
sage  set  and  recognizing  warn¬ 
ings  with  and  without  competing 
weather  broadcast  (Phase  2) • 

TABLE  8.-  AIRLINE  PILOT  ACCEPT¬ 
ABILITY  OF  SYNTHESIZED  SPEECH 
AND  DIGITIZED  SPEECH  FOR  COCK¬ 
PIT  VOICE  WARNINGS  (PHASE  2) . 


YFS  '  NO 

N 

WOULD  WANT 

THIS  VOICE  7  1 

IN  OWN  COCKPIT  _ _ _ 

8 

WOULD  WANT 

GPWS  TYPE  8  j  0 

VOICE  IN  OWN 

COCKPIT 

8 

1 

14 


n  i~~l 


Terrible  poor  medium  good  excellent 
pilot  rating 


YES  NO 
bothered  by 

VOICE  QUALITY 


dotted  lines  indicate  preferences  of  the  7  pilots 
who  returned  six  months  later  for  testing  on  the 

SAME  MESSAGES  WITH  PRIOR  FAMILIARIZATION  WITH 


MESoAut  jt  i  , _ 

Figure  10.-  Airline  pilot  ratinw 
of  pronunciation  clarity  of  ini¬ 
tially  unfamiliar  synthesized- 
speech  cockpit  warnings  heard 
at  increasing  signal-to-weather 
broadcast  sound  pressure  level 


ratios  (Phase  1) . 


eight  pilots  thought  that  the  warnings 
should  be  repeated  until  either  a  crew 
member  cancelled  them  or  the  situation 
was  corrected,  whichever  happened  first. 

One  of  these  seven  made  an  exception  to 
this  for  the  landing  gear  and  flaps  warn¬ 
ing,  stating  that  at  low  altitudes  above 
the  ground,  he  would  not  want  to  be  able 
to  cancel  this  warning  until  the  problem 
was  corrected. 

For  noncritical  warnings,  the  pilots 
were  divided  in  opinion  with  four  pre¬ 
ferring  two  or  three  repeats  with  auto¬ 
matic  cut-off  and  the  other  four 
preferring  to  have  the  warnings  repeated 
until  cancelled  either  by  a  crew  member 
or  by  correction  of  the  problem. 

These  pilots  judged  the  clarity  of 
pronunciation  of  the  messages  produced  on 
this  synthesizer  as  generally  good,  and 
seven  of  the  eight  said  that  they  would 
find  this  particular  voice  acceptable  in 
their  cockpit  (fig*  9  and  table  8).  All 
eight  pilots  judged  the  voice  quality  of 
the  Ground  Proximity  Warning  System 
(GPWS)  -  representing  digitized  speech  - 
acceptable  but  thought  tnat  the  current 
GPWS  voice  is  too  loud.  It  is  interest¬ 
ing  to  compare  the  responses  of  these 
pilots  to  their  responses  after  the  ear¬ 
lier  testing  in  the  unfamiliar  message 
condition  of  Phase  1.  Figure  10  shows 
the  voice  clarity  ratings  of  the  16 
pilots  in  the  Unfamiliar  Message  Set  con¬ 
dition.  As  a  group,  they  tended  to  rate 
this  synthesizer  as  having  medium  clarity 
of  pronunciation,  and  half  the  group  said^ 
they  were  ’’bothered  by  the  voice  quality." 
The  dotted  lines  show  the  responses  of 
the  seven  pilots  who  returned  six  months 
later  for  retesting  on  the  same  messages 
under  the  familiar  condition.  They  were 
representative  of  the  larger  group,  as 
evidenced  by  their  medium  ranking  of 
pronunciation  and  divided  opinion  regard¬ 
ing  voice  quality.  Several  factors  may 
have  combined  to  cause  them  to  change 
their  rating  of  the  pronunciation  and  to 
be  willing  to  have  this  particular  voice 


306 


in  their  own  cockpit,  such  as  familiarity  with  the  particular  messages  and 
experience  listening  to  the  synthesizer  voice. 


CONCLUDING  REMARKS 


Advantages  seemingly  may  be  gained  from  carefully  designed  cockpit  voice 
warnings.  One  design  principle  that  should  be  followed  in  order  to  ensure  the 
effectiveness  of  this  mode  of  presenting  warning  information  to  pilots  is  to 
include  sufficient  linguistic  redundancy  in  the  message  wording  to  ensure 
rapid  and  accurate  comprehension.  The  context  provided  by  a  short  sentence 
has  been  found  to  improve  accuracy  and  decrease  response  times  to  cockpit 
warnings,  particularly  when  the  key  words  in  the  warning  are  monosyllabic. 

This  effect  is  significant  despite  a  strong  facilitating  effect  of  pilots' 
familiarity  with  flight  phraseology.  Further  research  will  be  directed  at 
assessing  the  facilitating  effects  of  the  different  types  of  linguisitic  redun¬ 
dancy  -  semantic  and  syntactic  -  and  the  averting  and  "accent"-cueing  effect 
of  an  initial  low  information  word  provided  by  the  sentence  context.  These 
effects  will  be  studied  under  simulated  flight  conditions  with  varying  levels 
of  pilot  attention  workload. 
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EFFECTS  OF  LINGUISTIC  REDUNDANCY  ON  SYNTHESIZED  COCKPIT  WARNING 
MESSAGE  COMPREHENSION  AND  CONCURRENT  TIME  ESTIMATION 

Sandra  G.  Hart* 

University  of  California,  Berkeley,  California 
Carol  A.  Simpson** 

Ames  Research  Center,  NASA.,  Moffett  Field,  California 


SUMMARY 


The  relative  amounts  of  attention  required  to  comprehend  and  recognize 
two  types  of  speech  messages  that  differed  in  linguistic  redundancy  were 
evaluated  with  two  concurrent  time  estimation  tasks.  Two-word  and  sentence- 
length  synthesized-speech  cockpit  warning  messages  were  presented  to  12 
commercial  airline  pilots  with  and  without  competing  weather  broadcast.  The 
pilots  had  two  tasks:  a)  to  read  back  and  write  down  the  warning  messages 
and  b)  to  give  two  types  of  time  estimates.  It  was  found  that  the  intelli¬ 
gibility  of  messages  presented  in  a  redundant  sentence-length  format  was 
higher,  and  that  listening  time  and  number  of  repetitions  was  less,  during 
the  Familiarization  Phase.  Sentences  were  also  more  intelligible  than  two- 
word  messages  when  presented  in  a  background  of  competing  weather  broadcast 
for  recognition.  Differences  in  verbal  estimates  of  session  length  and  in 
the  length  of  10-sec  estimates  produced  by  pilots  indicated  that  messages 
presented  in  a  sentence  format  required  less  attention  for  comprehension  than 
did  two-word  messages. 


INTRODUCTION 


It  might  be  expected  that  the  many  different  concurrent  tasks  performed 
in  flight  would  demand  varying  amounts  of  attention  and  mental  workload,  even 
though  such  variation  might  not  be  reflected  in  pilot  performance  on  the 
primary  flying  task.  Two  types  of  time  estimation  tasks  have  been  used  as 
quantitative  measures  of  such  concurrent  task  processing  requirements: 
a)  verbal  estimation  in  which  the  length  of  an  interval  of  time  that  has  been 
experienced  is  estimated  in  minutes  or  seconds  and  b)  time  production  in 


*This  author’s  research  was  supported  by  NASA  Grant  NGL  05-046-002  to  the 
San  Jose  State  University  Foundation. 

**This  author’s  research  was  supported  by  a  post-doctoral  fellowship  from 
the  National  Research  Council. 
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which  the  duration  of  an  interval  of  time  that  is  specified  verbally  is  esti- 
aated  by  reproducing  it  operationally.  Hart  (ref.  1)  and  Hart  and  McPherson 
(ref  2)  reported  that  the  length  of  time  productions  made  during  compen¬ 
satory  tracking  and  simulated  flight,  and  the  length  of  verbal  estimates  made 
of  intervals  filled  with  tracking,  covaried  with  the  difficulty  of  the  manual 
control  tasks.  It  was  suggested  that  the  length  of  verbal  produced  time  esti¬ 
mates  reflected  the  attention  demands  of  concurrent  activity.  In  the  present 
study,  similar  time  estimation  tasks  were  used  to  assess  the  attention 
required  for  two  speech  processing  tasks:  a)  comprehending  unfamiliar  two- 
word  and  sentence-length  synthesized-speech  cockpit  warnings  (Familiarization 
Phase),  and  b)  recognizing  these  messages  once  they  were  fanriJiar  (Recognition 
Phase) . 


It  has  been  previously  reported  that  words  presented  in  a  sentence  con- 
te-ct  are  more  intelligible  than  the  same  words  in  isolation,  by  virtue  of  the 
increased  redundancy  (refs.  3,  4).  Linguistic  redundancy  refers  to  a  reduc¬ 
tion  in  the  number  of  interpretations  which  can  be  assigned  to  the  segments 
of  an  utterance  due  to  semantic,  syntactic,  and  phonological  constraints  such 
as  those  provided  by  a  sentence  frame.  Since  the  results  of  Simpson  s  research 
(ref.  3)  showed  increased  intelligibility  and  decreased  response  time  as  the 
redundancy  in  synthesized  cockpit  warnings  was  increased,  it  seemed  reasonable 
to  assume'  that  the  attention  required  of  pilots  to  comprehend  messages  ini¬ 
tially  and  recognize  them  once  they  were  familiar  would  be  inversely  related 
to  redundancy.  The  present  study  was  designed  to  assess  by  means  of  verbal 
and  produced  time  estimates,  the  attention  required  to  comprehend  and  recog¬ 
nize  warnings  presented  in  either  two-word  or  sentence  formats.  The 
operational  motivation  for  this  study  was  to  examine  the  possibility  that 
increased  linguistic  redundancy  would  reduce  the  attention  required  for  pilots 
lo  process  synthesized-speech  warning  messages  in  the  cockpit.  If  warnings 
could  be  worded  so  as  to  reduce  the  attention  required  for  their  compre¬ 
hension,  this  could  leave  more  attention  available  for  flying  the  airplane 
and  taking  proper  corrective  actions.  This  savings  in  attention  would  be 
especially  important  in  emergency  situations,  where  higher-than-normal 
attention  demands  exist. 

It  was  predicted  that  less  redundant  two-word  warnings  would  require 
longer  listening  time  and  more  repetitions  for  initial  comprehension  and 
that  they  would  be  less  intelligible  than  messages  presented  in  a  sentence 
format  during  both  the  Familiarization  and  Recognition  Phases.  If  attention 
demands  vary  inversely  with  redundancy,  as  predicted,  pilots  should  increas¬ 
ingly  underestimate  the  passage  of  time  as  message  redundancy  is  progressively 
decreased.  (Underestimation  occurs  when  concurrent  activity  momentarily 
diverts  attention  from  timekeeping  so  clock  time  passes  relatively  unnoticed.) 
Increasing  underestimation  of  time  will  in  turn  result  in  progressively 
shorter  verbal  estimates  and  longer  produced  durations. 
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TASK  AND  PROCEDURE 


Subjects 

Twelve  commercial  airline  pilots,  ranging  in  age  from  35  to  45  years, 
served  as  paid  participants  in  the  present  study. 


Test  Materials 


The  intelligibility  of  16  monosyllabic  and  polysyllabic  two-word 
synthesized-speech  warning  messages  was  assessed  by  computing  the  percentage 
of  words  correctly  identified  in  a  pilot  study.  Eight  messages  were  then 
assigned  to  each  of  two  message  sets  so  that  the  average  intelligibility  of 
the  sets  was  approximately  equal.  Sixteen  sentence-length  warning  messages 
were  then  constructed  from  each  of  the  sixteen  two-word  messages.  As  shown 
in  table  1,  each  sentence-length  warning  contained  the  same  two  words  in  the 
same  order  as  in  its  corresponding  two-word  version. 

TABLE  1.-  COCKPIT  WARNING  MESSAGES  AND  THEIR  DURATION  IN  SECONDS 


OURATION 

TWO-WORD  FORMAT 

SENTENCE  FORMAT 

DURATION  ] 
{S*cf  \ 

GROUP  1 
PILOTS 

FUEL  LOW 

CABIN  PRESSURE 

GEAR  DOWN 

DME  MALFUNCTIONING 
CHECK  FLAPS 

SPOILERS  INOPERATIVE 
TERRAIN  CLOSURE 

CARGO  FIRE 

.55 

.61 

.52 

1.52 

.66 

1.18 

.86 

.74 

YOU  HAVE  HYDRAULIC  SYSTEM  FAILURE.  1.89 

THE  FUEL  BOOST  PUMP  IS  OUT.  1.39 

THE  COCKPIT  WINDOW  IS  OVERHEATED.  1.51 

THE  WHEEL  BRAKES  HAVE  FAILED.  1.21 

THE  AUTOPILOT  IS  DISENGAGED.  1.36 

THE  OIL  TEMPERATURE  IS  HOT.  1.59 

THE  WING  ANTI-ICE  VALVES  ARE  INOPERATIVE.  2.19 
THE  MAIN  DOOR  IS  NOT  LOCKED.  128 

MEAN  DURATION 

.83 

MEAN  DURATION 

1.55 

DURATION 

TWO-WORD  FORMAT 

SENTENCE  FORMAT 

DURATION 

(fee) 

GROUP  2 
PILOTS 

HYDRAULIC  FAILURE 
BOOST  PUMPS 

WINDOW  OVERHEATED 
BRAKES  FAILED 
AUTOPILOT  DISENGAGED 
OIL  HOT 

ICE  VALVES 

MAIN  DOOR 

1.05 

.54 

.65 

.68 

.98 

.57 

.47 

.59 

THE  FUEL  PRESSURE  IS  LOW. 

THE  CABIN  PRESSURE  IS  LOW. 

THE  LANDING  GEAR  IS  DOWN. 

YOUR  DME  IS  MALFUNCTION^. 

CHECK  WUR  FLAPS  SETTING. 

YOUR  SPOILERS  ARE  INOPERATIVE. 

YOU  HAVE  RAPID  TERRAIN  CLOSURE. 

YOU  HAVE  CARGO  COMPARTMENT  FIRE. 

1.13 

1.13 

1.20 

1.60 

1.35 

1.68 

1.63 

1.87 

MEAN  DURATION 

.69 

MEAN  DURATION 

1.45 

The  warnings  were  synthesized  on  a  VOTRAX  VS-6  speech  synthesizer  using 
identical  phoneme  coding  for  the  key  works  in  both  the  two-word  and  sentence- 
length  message  formats.  A  PDP-12  computer  delivered  the  messages  and  recorded 
Lue  ^espouse  Limes. 


311 


X 


The  background  noise  used  in  the  Recognition  Phase  was  continuous 
weather  broadcast  recorded  from  a  local  station  operate'!  by  the  National 
Weather  Service  and  edited  to  eliminate  pauses.  The  ratio  of  warning- 
message  signal  peak  sound  pressure  level  (SPL)  to  weather-broadcast  noise 
SPL  was  +3  dB. 


Procedure 

Each  pilot  was  tested  individually  in  a  sound- attenuated  booth.  The 
warning  messages  and  weather  broadcast  were  presented  to  the  pilots  b inau¬ 
ra.!  ly  through  Koss  earphones,  Model  ESP-9.  Pilots  were  asked  to  initiate 
message  readback  by  keying  a  hand-held  microphone.  Subsequent  to  verbal 
readback,  pilots  were  asked  to  write  down  on  a  prepared  answer  sheet  as  much 
as  they  had  understood  of  the  message.  Message  readback  via  keyed  microphone 
was  used  to  provide  pilots  with  a  response  mode  similar  to  that  which  they 
normally  use  for  responding  to  spoken  messages  in  the  cockpit.  It  was  also 
expected  that  vocalization  would  result  in  shorter  response  times  than  would 
be  obtained  using  written  responses.  However,  for  intelligibility  scoring, 
a  written  record  of  the  pilot fs  response  was  required  so  as  to  eliminate 
possible  experimenter  bias  in  listening  to  recorded  spoken  responses. 

Pilots  made  verbal  time  estimates  during  the  Familiarization  Phase  by 
writing  their  estimate  of  elapsed  time  on  the  answer  sheet.  They  produced 
10-sec  intervals  during  the  Recognition  Phase  by  pressing  one  button  to  begin 
their  estimate  when  cued  to  do  so  by  a  pre-recorded  message  and  a  second 
button  to  terminate  their  production  when  they  believed  that  10  sec  had 
elapsed. 

Half  of  the  pilots  were  randomly  assigned  to  each  of  two  test  groups. 

The  six  pilots  in  one  group  received  the  first  set  of  eight  two-word  messages 
and  the  eight  sentence-length  messages  constructed  from  the  second  set  of 
two-word  warnings.  Conversely,  the  second  group  of  six  pilots  received  the 
second  set  of  two-word  warning:,  and  the  sentence-length  messages  constructed 
from  the  first  set  of  two-word  warnings.  The  messages  given  to  each  group 
and  their  durations  can  be  seen  in  table  1. 

All  twelve  pilots  were  given  three  familiarization  runs  for  each  of 
eight  two-word  messages  and  each  of  eight  sentence-length  messages.  After  a 
30-min  break,  pilots  were  given  one  recognition  run  in  which  each  message 
was  presented  one  time  only  in  a  background  of  continuous  weather  broadcast. 
The  order  of  presentation  of  the  two-word  and  sentence-length  warning  message 
sets  was  balanced  across  subjects  so  that  three  pilots  in  each  group  heard 
sentences  before  two-word  messages  during  all  the  test  runs.  The  other  three 
pilots  in  each  group  heard  the  two-word  versions  first,  and  then  the  sentence- 
length  versions.  The  eight  two-word  and  eight  sentence-length  messages  within 
the  two  sets  were  presented  in  a  different  random  order  for  each  pilot. 
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Familiarization  vhase-  During  the  Familiarization  Phase,  pilots  heard 
each  of  the  16  warning  messages  for  their  group  at  least  3,  but  not  more  than 
30  times  to  give  them  an  opportunity  to  understand  and  become  familiar  with 
as 'man v  messages  as  possible.  The  eight  two-word  and  eight  sentence-length 
messages  were  all  presented  on  each  of  three  familiarization  runs  without 
competing  weather  broadcast.  Within  a  familiarization  run,  each  message 
was  repeated  after  950  msec  intervals  until  the  pilot  keyed  his  microphone 
and  read  back  the  message.  After  10  repetitions  of  any  1  message,  its 
presentation  was  automatically  terminated  under  the  assumption  that  the  pilot 
was  not  going  to  understand  it  on  that  run.  Following  verbal  message  read- 
back  pilots  wrote  down  whatever  they  had  understood  of  each  message  on  a 
prepared  answer  sheet.  They  were  also  asked  to  estimate  the  amount  of  time  that 
thev  had  spent  listening  to  that  particular  message.  The  response  time,  which 
is  the  interval  between  the  beginning  of  the  first  message  repetition  and 
the  time  when  the  pilot  keyed  his  microphone,  was  recorded  and  used  to  cal¬ 
culate  the  number  of  message  repeats  and  the  ratio  of  estimated  to  actual 
time.  Pilots  were  given  no  feedback  on  message  comprehension  or  time  esti¬ 
mation  accuracy.  This  procedure  generated  the  following  data:  a)  response  _ 
latency,  b)  number  of  message  repeats,  c)  verbal  time  estimate,  and  d)  pilots 
written  versions  of  the  warning. 

Reeoc.nitior,  vhase—  During  the  Recognition  Phase,  pilots  were  asked  to 
produce  eight  10-sec  intervals  during  which  no  other  activity  occurred 
(baseline)  and  then  sixteen  10-sec  intervals  during  which  one  ot  the  eight 
two-word  or  eight  sentence-length  messages  that  had  been  presented  during  the 
Familiarization  Phase  was  repeated  one  time  only  with  a  background  of  contin¬ 
uous  weather  broadcast.  It  was  assumed  that  the  majority  of  the  messages 
had  been  comprehended  during  the  Familiarization  Phase  and  that  for  these  _ 
familiar  messages,  the  Recognition  Phase  of  the  experiment  involved  only  simpl 
recognition. 

4  pre-recorded  message  cued  pilots  when  to  begin  each  10-sec  production. 
Bv  pressing  one  button,  pilots  initiated  their  production  and  the  onset  of 
the  background  weather  broadcast.  Approximately  2  sec  later,  1  of  the  lb 
warnin'-'  messages  was  presented  imbedded  in  the  weather  broadcast.  Following 
warning-message  presentation,  which  lasted  no  longer  than  2.20  sec  (table  1), 
the  weather  broadcast  continued  until  pilots  pressed  a  second  button  to 
terminate  their  10-sec  production.  Then,  they  were  asked  to  key  their  micro¬ 
phone  and  as  quickly  as  possible,  read  back  and  then  write  down  the  warning 
message.  This  procedure  generated  the  following  data:  a)  length  of  time 
productions,  b)  pilots’  written  versions  of  the  warning,  and  c)  the  length 
of  time  pilots  waited  after  the  end  of  their  production  to  begin  message 
readback  (response  latency). 
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RESULTS 


Familiarization  Phase 

Overall  performance  change  across  runs —  Intelligibility,  as  measured  by 
the  percentage  of  words  correctly  identified,  increased  significantly  (F  =  5.12 
df  =  2,20;  p  <  .025)  across  the  three  familiarization  runs  (fig.  1(a)).  The 
total  time  spent  listening  to  each  message  (fig.  1(b))  decreased  significantly 
(F  =  15.50;  df  =  2,20;  p  <  .001)  across  runs.  The  number  of  times  that  mes¬ 
sages  were  repeated  (fig.  1(d))  also  decreased  significantly  (F  =  12.81; 
df  =  2,20;  p  <  .001)  across  runs.  The  ratio  of  overall  verbal  estimate  length 
to  actual  time  spent  listening  to  messages  (fig.  1(c))  increased  nonsig- 
nificantly  (F  =  2.77;  df  =  2,20;  p  >  .10)  across  runs. 


a.  INTELLIGIBILITY  c.  NORMALIZED  RATIO  OF  VERBAL  ESTIMATES 

(VE)  TO  RESPONSE  LATENCY  (RLI 


1  2  3 


fc  RESPONSE  LATENCY 


d.  NUMBER  OF  MESSAGE  REPEATS 


SUCCESSIVE  FAMILIARIZATION  RUNS 


W:  TWO-WORD  FORMAT 
WARNING  MESSAGES 

S:  SENTENCE  FORMAT 
WARNING  MESSAGES 


Figure  I.'-  Fet fuiuiduce  during  the  familiarization  runs 
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Two-word  versus  sentence-length  format-  Warning  messages  presented  m  a 
two-word  format  were  significantly  (F  -  13.39-  df  -  1,  £'J  ?  -005)  less 

intelligible  than  those  presented  in  a  sentence  rormat  for  all  runs  (fia.  1(a)). 
Note  that  there  were  more  words  to  be  remembered  in  the  sentence-length  rr 
sages  than  the  two-word  messages.  Ever,  so,  the  sentence-length  messages 
were  more  intelligible  as  a  further  consequence  of  the  added  linguistic 

redundancy. 

Pilots  allowed  the  two-word  messages  to  repeat  (fig.  1(d))  significantly 
(F  =  15  32*  df  =  1  10;  p  <  .005)  more  often  (at  least  one  more  time,  on  the 
average)  than  sentence-length  messages.  The  time  spent  listening  tc  two-wo-d 
messages  (fig.  1(b))  was  also  consistently  longer  than  that  spent  listening 
to  sentence-length  messages  even  though  each  sentence-length  messa6e  lastec. 
longer  than  its  corresponding  two-word  message.  This  dilferen  e  was  not 
statistically  significant  <F  ■  2.06;  df  -  1,  10:  p  >  .10)  *  ?llot 

Group  X  Message  Format  interaction  (F  =  7.09;  df  -  1,  10,  p  .0.5). 

Verbal  estimates  of  the  length  of  time  spent  listening  to  two-word 
warnings  were  consistently  shorter  than  the  amount  of  time  that  had  actually 
passed!  whereas  verbal  estimates  of  intervals  filled  with  messages  presented 
in  a  sentence  format  were  close  to,  or  slightly  longer  than,  elapsed  clock 
time  (fig.  1(c)).  This  difference  in  verbally  estimated  duration  as  a 
funtion  of  message  format  was  statistically  signmcant  (F  -  .5. >9,  df  1,10, 

p  <  . 001) . 

Influence  o'  key  word  intelligibility-  The  16  pairs  or  key  words  were 
rank  ordered  on  the  basis  of  intelligibility  scores  across  familiarization 
runs,  and  then  divided  into  quar tiles  that  represented  increasing  intelli¬ 
gibility.  As  can  be  seen  in  figure  2(a),  there  was  e  large  difference  in 
intelligibility  scores  for  two-word  messages.  Less  intelngi  le  two-wor 
messages  were  repeated  more  often  (fig.  2(b))  and  were  associated  witn  the 
shortest  verbal  estimates  (fig.  2(c)).  These  same  key  words,' wnen  Present^ 
in  a  sentence,  were  consistently  more  intelligiole  (fig-  2(a)).  The  faci.l 
itation  provided  bv  a  sentence  format  was  particularly  great  for  the  messages 
that  were  the  least  intelligible  w.ien  presented  m  a  two-word  format,  sentences 
were  repeated  less  often  (fig.  2(b))  and  sentences  were  associated  with  verba 
estimates  that  most  closely  approximated  the  amount  of  time  that  had  actu¬ 
ally  elapsed,  for  all  quartiles  of  key  word  intelligibility  (fig. 
differences  between  two-word  and  sentence  formats  for  all  three  measures 
(intelligibility,  number  of  repetitions,  and  verbal  estimate  Jurat um)  were 
statisf ically  significant  (p  <  .002)  in  each  case,  as  measured  by  Wilcoxon  s 
signed  ranks  test  for  differences  between  16  pairs  of  means. 

Influence  of  message  duration-  Intelligibility  was  “f  “r"^.Hlth 
simnle  message  length.  For  example,  the  two-word  message  DME  dal fun c 
tioning"  was  the  longest  two-word  warning  (1.5  sec),  yet  it  ”as  tne  eaf,y  , 
intelligible  message  when  presented  in  a  two-word  format. #  The  message  Your 
DME  is  Malfunctioning"  was,  by  contrast,  one  or  the  most  intelligible  mes 
sages^by3 virtue  of  the  additional  words  "your"  and  "is"  in  the  sentence  context. 
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a.  INTELLIGIBILITY 


b  NUMBER  OF  MESSAGE  REPEATS  c.  NORMALIZED  RATIO  OF  VERBAL  ESTIMATES 

IVE)  TO  RESPONSE  LATENCY  (RL) 


INTELLIGIBILITY  QUARTILES  FOR  KEY  WORDS  ACROSS  3  RUNS 


W:  TWO-WORD  FORMAT  FOR  WARNING  MESSAGES 
S:  SENTENCE  FORMAT  FOR  WARNING  MESSAGES 


Figure  2.-  Comparison  of  two-word  versus  sentence  format  for  intelligibility 
quartiles  (based  on  key  word  intelligibility  for  3  familiarization  runs) 


Recognition  Phase 

During  the  one  recognition  run,  the  intelligibility  scores  were  higher 
than  thev  were  during  the  three  familiarization  runs,  even  though  each  of  the 
16  messages  was  presented  only  once  and  with  competing  weather  broadcast. 
Two-word  message  intelligibility  increased  from  76  percent  to  82  percent  and 
sentence  intelligibility  increased  from  95  percent  to  96  percent  between  the 
last  familiarization  run  and  the  one  recognition  run. 

?uo-'Jovd  versus  sentence-length  format- As  before,  sentences  were  more 
intelligible  than  two  word  messages  (fig.  3(a)).  Pilots  also  began  sentence 
readback  more  quickly  than  two— word  message  readback  (fig.  3(b)).  Overall, 
prod;,  ad  durations  lasted  12.55  sec,  and  were  nearly  identical  for  intervals 
filled  with  two-word  and  sentence-length  messages  (fig.  3(c)).  This  represents 
an  increase  of  2.5  sec  in  production  length  over  the  length  of  baseline  produc¬ 
tions  made  with  no  warning  meassage  presentation. 

Influence  of  message  familiarity —  Several  of  the  messages  had  not  yet 
been  completely  understood  at  the  end  of  the  Familiarization  Phase  so  26 
percent  of  the  messages  were  still  unfamiliar  at  the  beginning  of  the  Recog¬ 
nition  Phase.  This  phenomenon  occurred  more  often  for  two-word  than  sentence- 
length  messages  (fig.  4(a)).  For  messages  that  were  still  unfamiliar, 
response  latencies  were  1.0  sec  slower  (fig.  4(b))  and  produced  durations  were 
2.0  sec  longer  (fig.  4(c))  than  were  response  latencies  and  produced  durations 
for  messages  that  had  already  been  understood  and  were  simply  recognized.  The 
1  epot"b  of  produced  durations  that  had  been  filled  with  uUi.aiuij.iar  two— word 
messages  was  1.5  sec  longer  than  the  length  of  productions  filled  with 
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a.  INTELLIGIBILITY 


b.  INTERVAL  BETWEEN  END  OF 
PRODUCTION  AND  READ  BACK 


c.  DURATION  OF  PRODUCTIONS 


TWO-  SENTENCE  ™°_  SENTENCE  v 

WORD  WOHU 

FORMAT  IN  WHICH  THE  SYNTHESIZED  MESSAGE  WAS  PRESENTED 


TWO-  SENTENCE 
WORD 


Figure  3.—  Performance  during  the  recognition  phase 


a  FREQUENCY  OF  OCCURRENCE 


b.  INTERVAL  BETWEEN  END  OF 
PRODUCTION  AND  READ  BACK 


c.  DURATION  OF  PRODUCTIONS 


TWO-  SEN-  TWO-  SEN- 

WORD  TENCE  WORD  TENCE 

UNFAMILIAR  FAMILIAR 


TWO-  SEN- 
WORD  TENCE 
UNFAMILIAR 


TWO  JEN 
WORD  TENCE 
FAMILIAR 


TWO-  RFN  TWO  SEN- 
WORD  TENCE  WORO  TENCE 
UNFAMILIAR  FAMILIAR 


□  TWO  WORD  FORMAT.  MESSAGE  STILL  UNFAMILIAR 
AT  BEGINNING  OF  RECOGNITION  PHASE 
V  /'"I  SENTENCE  FORMAT,  MESSAGE  STILL  UNFAMILIAR 
vMA  aT  BEGINNING  OF  RECOGNITION  PHASE 


f— — I  TWO  WORD  FORMAT.  MESSAGE  FAMILIAR  AT 
BEGINNING  OF  RECOGNITION  PHASE 

V  T\  SENTENCE  FORMAT,  MESSAGE  FAMILIAR  AT 
BEGINNING  OF  RECOGNITION  PHASE 


Figure  4.—  Performance  during  the  recognition  phase  as  a  function 
of  warning  message  familiarity 
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unfamiliar  sentence-length  messages.  For  familiar  messages,  however,  produced 
durations  filled  with  two-word  messages  were  slightly  shorter  (O.a  sec)  than 
productions  filled  with  sentences. 


Influence  of  Number  of  Syllables 

In  order  to  compare  the  results  of  the  present  study  to  results  by 
Simpson  (ref.  3),  Che  influence  that  the  number  of  syllables  had  on  intelli¬ 
gibility  was  also  determined  for  both  the  Familiarization  and  Recognition  Phases. 
While  polysyllabic  words  were  not  necessarily  longer  in  duration  than  mono- 
svllabic  words  %  the  former  were  more  intelligible,  again  suggesting  that 
linguistic  redundancy,  rather  than  simple  duration,  was  the  most  important 
factor  in  intelligibility.  These  results  confirm  the  results  of  Simpson  s 
research  (ref.  3).  During  familiarization  runs,  both  monosyllabic  and 


a.  FAMILIARIZATION  PHASE  b-  RECOGNITION  PHASE 


I - {  monosyllabic  words  presented  in  two  word  format 

£22  POLYSYLLABIC  WORDS  PRESEN  TED  IN  TWO  WORD  FORMAT 

f - |  MONOSYLLABIC  WORDS  PRESENTED  IN  SENTENCE  FORMAT 

f“ — \  POLYSYLLABIC  WORDS  PRESENTED  IN  SENTENCE  FORMAT 

Figure  5.-  The  influence  of  monosyllabic  and  polysyllabic  words 
b  on  intelligibility  as  a  function  of  message  format 


polysyllabic  two-wora  warnings  were  less  intelligible  than  the  sane  monos.  1- 
labic  and  polysyllabic  messages  presented  in  sentences  (fig.  urmg 

recognition,  with  competing  weather  broadcast,  polysyllabic  words  increased 
two-word  message  intelligibility  to  91  percent.  However  there  was  no  corre¬ 
sponding  gain  in  Intelligibility  for  monosyllabic  words  (fig.  5(b)). 
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DISCUSSION 


The  results  of  the  current  research  confirm  and  expand  the  research 
i  Kr  ci„„,o™  /ref.  3)  with  respect  to  the  facilitating  influence  o 

linguistic  redundancv  on  initial  comprehension  and  subsequent  recognition. 
lin0ui  "_ced  in  a  sentence  format  were  consistently  more  intelli¬ 

gible118  required  fewer  repetitions  and  less  listening  time  than  two-word  messages 
during  familiarization,  and  were  more  easily  recognized  and  read  bade  more 
quickly  during  recognition. 

Variation  in  the  length  of  verbal  time  estimates  and  productions  was 
t-hp  at'ention  required  to  process  messages  differing  in  lin 
used  to  assess  the  at.  q  J  that  active  vcrbui  time  estimation 

guistic  redundancy.  It  has  bee  88^^  .s  diverced  £rom  timekeeping  by 

requires  at ten :ion  ' O *  continues  whereas  subjective  timekeeping  does 
COrTZs  Tsparllr  results  in  an  underestimation  of  elapsed  time  so  that 
"  h,l  L-tiiatL  of' the  duration  of  an  interval  are  too  short.  As  expected, 
Ver£^  e«lMMS  of  the  duration  of  intervals  filled  with  less  redundant,  less 
intelligible ,  two-word  messages  were  consistently  shorter  and  less  accurate 
,  .  f  intervals  filled  with  sentence-length  messages  during  camil 

t  lan  53  .  verv  short  verbal  estimates  indicate  that  comprehending 

two -wo r d° me s s age ^ may  require  more  attention  than  comprehending  similar  mes¬ 
sages  presented  in  sentences. 

Jr^ed'duMtions  that  are  too  long  In  tha  present  study, 
present  Inn  weather  broadcast  and  warning  messages  while  the  tr*  Production 
Ls  in  progress  was  J^^So^TuUd  with 

length  of  W i«  length  to  those  filled  with 

two-word  messages  w.r.e  sentence- length  messages  had  the  potential 

™  onderstanding  hat 

did  not  require  additional  attention. 

rr  is  also  interesting  to  note  that  produced  durations  filled  with  mes- 
”T  Tel  "o  “c  fo^r”  rr  “iUed^r^iooSy  i"™i  «"sages 

So?  ZTnTyZ  been  learned  as  compared  “  ^ 

“s -“r  h  ler inceui- 

s“lPe  “Hi  r^ulre  core  attenrion  when  unfamiliar  than  when  familiar. 
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Sentence-length  warnings  appear  to  require  less  attention  to  comprehend 
initially  than  two-word  messages.  The  former  were  associated  with  productions 
that  were  1.5  sec  shorter  than  those  for  intervals  filled  with  two-word  mes¬ 
sages.  However,  sentences  that  had  been  previously  learned  had  longer  produc¬ 
tion  times  than  did  two— word  messages  that  were  familiar.  Once  a  familiar 
message  had  been  recognized,  pilots  haH  to  hold  it  in  memory  during  the 
remainder  of  the  production.  Distraction  from  time  estimation  most  likely 
varied  as  a  function  of  the  number  of  items  held  momentarily  in  memory.  If 
pilots  attended  to  the  passage  of  time  only  at  the  end  of  each  mental  repetition 
of  the  recognized  message,  the  sentences,  being  longer  units,  resulted  in  less 
frequent  attention  to  time,  and  thereby  longer  productions,  than  did  the  two- 
word  warnings. 


CONCLUDING  REMARKS 


From  these,  as  well  as  previously  reported  results,  it  appears  that  ver- 
bal  and  produced  time  estimates  may  provide  a  useful  index  of  the  attention 
demands  of  concurrent  activity.  Factors  associated  with  the  intelligibility 
of  synthesized  warning  messages,  such  as  redundancy  and  familiarity,  pre¬ 
sumably  influenced  the  attention  required  to  process  them.  The  length  of 
verbal  estimates  and  produced  durations  provided  an  indirect  measure  of  the 
amount  of  attention  required  by  messages  differing  in  intelligibility. 

The  results  of  this  research,  and  of  that  reported  by  Simpson  (ref.  3), 
have  demonstrated  the  facilitating  influence  of  linguistic  redundancy,  such 
as  that  provided  by  a  sentence  format,  on  cockpit-warning  comprehension  and 
recognition.  Even  though  sentences  last  longer  and  contain  more  words  to  be 
remembered  than  do  two-word  messages,  the  former  require  fewer  repetitions, 
less  time,  and  less  attention  to  comprehend  initially  and  are  read  back  more 
quickly  and  accurately  once  they  are  familiar. 
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INTRODUCTION 


The  problem  of  aircraft  control  during  approach  and  landing 
operations  deserves  strong  attention,  as  evidenced  by  aircraft 
accident  statistics.  Approximately  one-half  of  all  aircraft 
accidents  occur  during  such  operations  despite  their  contribut¬ 
ing  only  a  brief  portion  of  the  total  flight  time.  (1) 

A  primary  cause  is  the  heavy  demand  placed  on  a  pilot — 
especially  the  division  of  visual  attention  required  to  control 
the  flight  path  and  airspeed.  During  the  approach  to  landing, 
information  pertaining  to  the  flight  path  is  primarily  obtained 
from  visual  cues  outside  the  cockpit,  while  pitch  information  is 
obtained  via  a  panel-mounted  airspeed  display. 

In  addition,  just  prior  to  touchdown,  the  pilot's  visual 
attention  is  progressively  drawn  towards  runway- specific  cues 
allowing  little,  if  any  use  of  instrument  panel  information. 

It  is  hypothesized  then,  that  the  use  of  a  non-visual  dis¬ 
play  would  allow: 

(a)  An  alleviation  of  this  unwanted  division  of  visual 
attention,  and 

(b)  the  opportunity  of  presenting  accurate  pitch-command 
information  during  the  roundout  (or  flare)  just  prior 
to  touchdown. 


*  The  efforts  reported  here  were  sponsored  by  the  National  Aero¬ 
nautics  and  Space  Administration  (NASA) ,  Contract  No.  NAS  2-8954. 


322 


this  should  be  especially  beneficial  to  a  novice  pilot  -  ”h° 
lacks  the  experience  to  use  relevant  visual,  inertial,  and  au.al 
cues.  This  should  result  in  the  following  benefits: 

(1)  His  task  should  be  simplified 

(2)  His  performance  should  be  improved  and 

(3)  His  learning  of  the  appropriate  visual  cues  from  out¬ 
side  the  cockpit  should  be  enhanced. 

Previous  work  by  Fenton,  Gilson  and  others  (2,  3,  4  &  5)  has 
shown  an  effective  and  practically  implemented  non-visual  dis¬ 
play  for  automobile,  aircraft,  and  helicopter  controls.  Based 
on  their  findings,  the  use  of  a  kinesthetic-tactual  display 
appears  to  be  a  natural  wav  of  overcoming  the  unwanted  division 
of  visual  attention  during  flight,  while  also  avoiding  any  fur- 
ther  division  of  auditory  attention. 

In  the  present  study,  information  pertaining  to  the  aero¬ 
dynamic  state  of  an  aircraft  was  continuously  presented  via  a 
kinesthetic-tactual  display  during  takeoffs  and  landings.  Fig¬ 
ure  1  depicts  the  desired  control  information  during  an  approach 
-,nd  lardinq.  The  tactual  display  is  programmed  to  present  crit¬ 
ical-pitch  directives  from  the  beginning  of  takeoff  roll  through 
the  approach  and  flare-to-landing.  The  reference  input  is  the 
desired  angle-of-attack  (AOA)  which  is,  of  course,  related  to  the 
desired  approach  airspeed.  Whenever  the  aircraft  is  higher  than 
50  feet  above  the  ground  the  desired  angle-or-attacx  is  nela  con¬ 
stant.  Below  50  feet  the  desired  AOA  is  an  increasing  inverse  ^ 
function  of  height,  so  that  the  aircraft  is  near  stall  at  liftoti 

and  at  touchdown. 

Figure  2  shows  the  control  loop.  The  feedback  signal  is  the 
measured  AOA.  The  display  input  is  the  difference  between  the 
measured  and  reference  (or  desired)  AOA- thus,  a  compensatory 
tactual  "pitch  director." 

The  tactual  display  was  built  into  the  head  of  a  convention¬ 
al  aircraft  control  yoke.  The  display  itself,  photographed  m 
Fiqure  3  consisted  of  a  moveable  section  of  the  control  grip. 

This  is  shown  as  protruding  from  the  forward  part  of  the  grip  and 
recessed  into  the  aft  part.  This  protrusion  corresponds  to  an 
unwanted  increase  in  angle-of-attack.  and  a  pilot  responds  by 
moving  the  yoke  forward  so  as  to  decrease  this  angle  and  return 
the  display  to  its  neutral  or  flush  position.  Next,  m  Figure  4 
is  a  view  of  the  display  protruding  "backward"  which  requires  an 
aft  corrective  motion  of  the  control  yoke.  In  essence,  the  pilot 
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follows  the  display  commands  to  reduce  errors  to  zero. 

PROCEDURE 

The  experimental  inflight  study  considered  novice  pilot  per¬ 
formance  while  flying  approaches  to  landing.  ° 

12  novice  pilots  received  flight  instruction  with  a  standard,  3 
hour  primary  training  syllabus  designed  to  teach  them 
+-  ic  of  aircraft  control  (but  without  allowing  practice  of  land 
ills.)  At  S*  completion  Of  this  Preliminary  Training,  the  sub- 
jects  were  evaluated  on  a  standard  series  of  test  maneuvers 
designed  to  equally  match  subjects,  by  performance,  into 

groups. 

Th-  first  experimental  phase  then  commenced,  wherein  the 
subjects  flew  four  one-hour  test  periods  of  six  takeoffs  and 
landings  One  group,  designated  the  airspeed  group,  received 
only  the' conventional  visual  display  of  airspeed  The  other,' the 
tactual  group,  received  only  the  tactual  displayed  information. 

The  second  experimental  phase  then  followed,  consisting  of 
4-  ^  Hour  test  periods,  wherein  the  display  conditions  were 

reversed-  that  is,  the  airspeed  trained  group  now  flew  with  the 
tactual  display,  while  the  tactually  trained  group  received 
visual  airspeed  information  only.  In  all  the  experimental  phases. 

performance  was  analyzed  during  the  final  approach  and  landing 
from  aDoroxima te ly  Uj  miles  out  to  touchdown.  Performance  meas¬ 
ures  included:  the  number  of  'nstructor  pilot  (a)  verbal  assists 
(b)  assisted  landings  and  (c)  actual  takeovers  during  landing 
where  safety  of  flight  was  involved) .  Touchdown  performance 

mngiqted  of-.  (a)  lateral  deviations  from  runway  center 
line^b)  longitudinal  touchdown  position  measui  ed  from  the  fixed 
distance  marker  and  (c)  sink  rate  or  «g"  loading  at  touchdown. 
Other  performance  measures  recorded  but  not  as  yet  analyzed  in¬ 
clude,  angle  of  attack,  flight  controls  positions,  runway  lineup, 
and  glide  path  err  r*  during  the  final  approach. 


RESULTS  AND  DISCUSSION 


Preliminary  data  are 
r}p!p>‘ji  rt?  thp  tot  a.  1  number 


shown  in  Figures  5  and  6.  Figure  5 
of  unassisted  landings  performed  by 
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_ test  period.  Unassisted  landings  were  those 
subject  groups  per  test  pe  „  touchdown.  With  6  landings 

expected  not  to  exceed  three  9  s  »t  ^  ^  per 

per  hour  and  6  subjects  per  gre  P.  initially  trained  with  tne 
test  period  were  possib  .  •  9-  P  t_performed  the  airspeed 

tactual  display  significant1'/  ^p<  * Oil  out^^  when 

°traLf«red  ^fl/^the  visual  airspeed  indicator 

only. 


This  suggests  ^at  the  group^nitian^trained^tt^n^ 

tactual  display  was  facii  unassisfced  landings) .  and  that  they 
following  the  display  oerforra  quite  well  (almost  70% 

tad  learned  enough  to  subsSue„tlyePerfo™  display  0Dly. 

unassisted  -and^  ]  initial  hypothesis  that  a  "crutch-like 
The  latter  re^e  might  Svelop  when  this  group  was  trans- 

dependency  on  the  d  P  Y  condition.  On  the  other  hand,  the 
f erred  to  the  visua  -  ^  airspeed  display  performed  like 

group  first  trai  d  approximately  22%  unassisted  land- 

typical  flight  studen  riods.  what  was  remarkable,  however. 

in9SrtTthe"airspeed  group  did  not  significantly  improve  w*en 

they^were^ransferred  to  the  tactual  £*£-  f  ^ 

thf  initial  training  periods  for  the 

tactual  group. 


fviq  lack  in  improvement  is  suggested  in 
interaction  effect  of  group  by  test  period  (p<.05). 


„  ,  .  aooarent  that  when  the  airspeed  group  was  transferred 

t0  “^attnlncr^fX  [^^^0^!  ™ese  dfta 
that-some  apparel 

conf  lic^may^occur  for  the  ^speed^roupjiecause  their^initial 

fratif;yitriai1andnerror.aVsich  strategies  most  likely  differed 
largely  by  tna  +-*rtual  display.  Conversely, 

from  the  ^PP-ch  =ommahded  by  the  a  d?Jay  „as  forced 

S^rr^er^t^rdesired  flare-to-landing^trateg,.  Jhus. 

they  had  learned  before  rather 

than  trial  and  error. 
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Among  the  other  performance  measures,  no  significant  dif¬ 
ferences  were  found  in  either  the  number  of  verbal  assists  given 
by  the  instructor-pilot,  or  in  the  vertical  velocity  or  "g"  load¬ 
ing,  just  at  touchdown  for  unassisted  landings.  This  suggests 
that  no  apparent  biases  were  introduced  by  the  instructor-pilot. 


CONCLUSIONS 


There  are  several  directions  of  future  research  that  are 
envisioned  at  this  time. 

(1)  For  pilot  training  purposes,  tactual  commands  may 
benefit  a  student's  understanding  of  the  interactions  between 
aircraft  attitude,  speed  and  power  by  means  of  a  tight  control 
loop  and  the  display's  natural  stimulus-response  compatibility. 

(2)  For  more  basic  studies,  cognative,  perceptual,  and 
motor  learning  may  be  investigated  by  employing  appropriate 
tactual  feedback  to  differentially  augment  visual  or  tactual 
display-control  relationships.  Such  studies  are  currently  being 
initiated  in  a  flight  simulator  with  a  computer  graphics  display 
of  the  landing  environment. 

(3)  For  practical  aircraft  control,  kinesthetic-tactual 
displays  incorporated  within  the  appropriate  control  levers 
could  allow  for  zero-zero  landings.  For  example,  during  an  in¬ 
strument  approach,  a  power  command  could  be  located  in  the 
throttle  lever  and  a  lineup  command  could  be  incorporated  in  a 
multiaxis  control  yoke.  Indeed,  the  tactual  pitch  commands 
currently  allow  an  ILS  approach  to  touchdown  if  the  cross-pointer 
indicator  is  utilized. 

In  summary,  the  exploration  of  the  efficacy  of  tactual  dis¬ 
plays  in  the  aircraft  environment  has  only  begun  and  many,  per¬ 
haps  significant,  improvements  may  be  made  in  both  the  safety 
and  precise  control  of  aircraft. 
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SUMMARY 

Compensatory  tracking  experiments  in  one  and  two  axes  using  tactile 
d isplavs  are  described.  The  electro-tactile  inputs  to  the  operator  are 
located  circumferentially  about  the  waist  region  utilizing  four  electrodes. 
The  error  signal,  applied  to  the  operator  through  the  electrodes,  is  an 
amplitude  modulated  100  Hz  pulse  train  with  a  fixed  pulse  width  or  40  y,sec. 
The  dynamic  range  of  the  display  between  sensation  threshold  and  pain  is 
one  order  of  magnitude.  The  operator's  output  is  isometric  muscle  force 
applied  to  a  two  degree  of  freedom  joystick.  Describing  functions  for  both 
one  and  two-dimensional  tracking  with  low  frequency  pseudo-random  noise 
inputs  are  presented.  The  results  indicate  that  tactile  tracking  with  this 
particular  display  in  one-  and  two-dimensions  is  indeed  practical.  Coherence 
analysis  of  the  two-dimensional  tracking  task  indicates  that,  for  the  low 
frequency  forcing  function  utilized,  no  significant  linear  cross-coupling 
exists  in  the  operator  model.  Therefore,  a  two-dimensional  task  can  be 
considered  to  be  two  independent  one-dimensional  tasks.  A  comparison  of 
the  tracking  ability  of  traumatic  paraplegics  with  that  of  normal  individuals 
allows  the  conclusion  that  there  are  no  essential  differences  between  these 
groups. 


INTRODUCTION 

Paraplegics  with  spinal  cord  lesions  at  or  above  L3  have  a  great  deal 
of  difficulty  in  ambulating.  One  of  their  major  problems  is  the  lack  of 
motor  control  over  knee  and  hip  joints.  Poliomyelitis  paraplegics,  whose 
condition  is  due  to  a  virus  which  selectively  attacks  motor  fibers,  have 
their  sensory  ability  remaining  intact,  which  allows  them  to  perceive  the 
position  and  velocity  of  their  paralyzed  limbs.  This  perception  of 
position  and  motion  is  called  proprioception.  In  contrast,  traumatic 

*  Supported  in  part  by  the  Rehabilitation  Services  Administration,  DHEW 
under  grant  number  RD-23-P-55442 . 


332 


.  ^  cn4n;1i  rn rd  lesion  due  to  a  traumatic  injury  usually 

The  poliomielit  P  P  j  cruCches,  whereas  the  traumatic  paraplegic 
long  leg  brace  ,  hair  The  lack  of  proprioception  in  the  legs 

C,Tr  ""S  T„1  difference  between3 these  U«-U.  Based  on  this 

s; 

eleCtOutaofythe1tSercomSnly  used  channels  of  communication,  «hich  are 
i  rit  t-nrv  and  tactile,  the  latter  was  chosen  to  be  the  mode  of 
visual,  audito  y  f  h  sensory  feedback.  The  visual  channel  is 
information  display  for  the  se  y  ^  f<jr  ^  in  obstacle 

utilized  admiration  of’scenery.  Although  the  auditory  channel 

avoidance  an  ik;  Cask  it  can  be  most  easily  jammed  or  over- 

is  less  uti  ize  in  sirens  etc.  For  both  visual  and  auditory 

displays' the^ problem  of  providing  a  colmetically  acceptable  mechanization 
P  '  h  a .♦ ff i rul t* v  For  the  tactile  channel,  especially  when  the 

r.;s“*rE.£ !s°is  «'.n  £.1-.  *.  *»«».  ^  *««««•«•  - 

«*““<=  arPt‘fbtil«unr!SlSy«Su“5li  been  reported;  their 

S  ,  L  “»  priced  in  Reference  1.  The  displays  for  thee. 

s=  h£t?.s  ^ 

ihedisolavs  were  multi-electrode  or  vibrator  type  arranged  in  +  or  H 
f^hionPon'the  chest  and  abdomen  of  the  pilot.  The  problems  with  all 

f  Hi  ,  are  that  they  do  not  provide  an  easy  mode  of  presenting 

bod^posture  information  to  the  subject  wearing  the  display. 

presented  also  were  not  analyzed  for  cross-coupling  components  and  do  not 

provide  a  model  of  the  human  tracking  an  input  whose  power  sPect^"’ 

^  Moc  static  posture  sway.  Humans  standing  erect  control  small 
oscillatory  movements  about  the  normal  upright  position*,  this  deviation  is 
.Sed  swly  »  ha,  been  measured  and  its  pouer  spectral  content  has  been 

r,PO 'Se  basic ^objectives  of  this  study  are  (1)  to  she.  the  feasibility 
of  tracking  a  forcing  function,  having  a  pomer  spectral  density  (PSD) 
LnafJo  standing  stray  in  humans,  utilizing  an  electro  tactile  display 
of“o"  aectdes  located  about  the  waist,  (2)  to  model  the  human  operator 
tracking  this  forcing  function  in  one-  and  two-dimensional  compensatory 
tasks  (3)  to  investigate  Che  cross  coupling  terms  in  the  human  operator 
model'  in  order  to  determine  chei,  c.lativ.  importance  and  (4)  to  compare  the 
tracking  ability  of  normals  to  that  of  paraplegics. 


333 


\ 


experimental  design 


The  simple  tactile  compensatory  tracking  task  considered  in  this 
experiment  is  shown  in  Figure  1.  The  subject  in  the  loop  has  the  task  of 
minimizing  the  error  displaved  to  him.  A  pseudo-random  binary  sequence 
“'.or1:  fpp/opSac.ly  filtered,  provides  the  forciog  function  c  to  be 
tracked  The  subject's  force  response  r,  a  function  of  his  voluntary 
muscle  force  transformed  by  the  isometric  joystick  manipulator,  is  subtracted 
£  i  o  provide  a  measure  of  the  tracking  error  e.  The  tactile  display 
signal  generator  produces  negative  constant  current  pulses  at  a  frequency  of 
ioo  Hz  With  a  duration  of  40  Usec  and  an  amplitude  directly  proportional 
to  the  error.  The  tactile  display  consists  of  an  elastic  belt,  four  (4) 
electrodes  and  the  associated  wiring  shc^n  in  Figure  2.  The  p  -ement  of 
the  electrodes  about  the  waist  region  is  shown  in  Figure  3.  Details 
the  surface  electrode  can  be  seen  in  Figure  4.  These  electrodes  are 
constructed  from  silver  according  to  the  findings  m  Reference  4  and  are 
of  the  concentric  type.  The  center  circular  region  is  the  active  area 
supplying  the  negate  current.  It  is  isolated  from  the  rectangular  ground 

olane  bv  a  small  teflon  spacer*  ,  ,  u 

The  dvnamic  range  of  the  display  is  one  order  of  magnitude  between  the 
threshold  of  sensation  and  the  threshold  of  pain.  This  range  is  indicated 
in  the  strength  duration  curves  presented  in  Figure  5.  There  the  current 
stimulus  amplitude  for  sensation  and  pain  threshold  is  plotted  for  a  number 
of  pulse  durations.  The  duration  ot  40  p.sec  was  chosen  since  it  provi  es 
reasonable  dynamic  range  without  requiring  unduly  large  voltages.  The 
electrodes  are  applied  to  the  skin  after  a  small  amount  of  electrode  gel 
is  applied .  The  electrodes  in  this  configuration  present  a  dynamic 
impedance  of  approximately  2-4K  ohms.  The  minimum  level  of  stimulation 
throughout  the  experiments  was  adjusted  to  the  threshold  or  sensation  in 
order  to  eliminate  the  dead  zone  inherent  in  this  display  configuration. 

The  two-dimensional  tracking  system  is  shown  m  t  igure  o,  in  crock 
diagram  form.  The  pseudo-random  noise  generator  is  a  1 7  stage  digital 
shift-register  with  a  modulo-2  adder  in  the  feedback  path.  Design  details 
may  be  found  in  Reference  5.  An  original  sequence  S  and  a  delayed 
version  S  can  be  generated  through  utilization  of  additional  modulo-2 
adders.  For  this  particular  design,  utilizing  a  clock  rate  of  30  Hz,  the 
duration  of  the  sequence  is  72.8  minutes.  The  delayed  sequence  is  shifted 
by  36.4  minutes  providing  two  independent  signals.  Each  of  these  sequen 
are  filtered  bv  identical  filters  labeled  A/P  (antenor/postenor)  and  La t. 
(lateral)  swav" filter.  The  output  of  each  summing  junction,  representing 
the  tracking  errors  e,  is  amplified,  separated  into  positive  and  negative 
components,  synchronously  sampled,  converted  to  stimulus  current  and  finally 
applied  to  the  subject  via  the  electrodes.  The  tactile  display  gain  (TDG 
was  set  at  five  for  all  experiments  presented  and  is  included  m  the  models. 
This  gain  setting  minimizes  the  tracking  error;  that  is,  lower  and  higher 
gains  tend  to  increase  the  tracking  error  score  as  measured  by  the  performance 
monitor  (PM).  The  performance  monitor  calculates  the  integral  of  the 
absolute  value  of  the  tracking  error,  which  is  displayed  on  a  meter  for 
on-line  performance  evaluation.  The  isometric  force  response  measured  by 
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FIGURE  1.  Tactile  Compensatory  Tracking  Task 
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FIGURE  3.  Electrode  Placement 
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FIGURE  5.  Strength  Duration  Curve  Indicating  Dynamic 
Range  of  Display 
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FIGURE  6.  Tracking  System  Block  Diagram 
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the  two  degree  of  freedom  manipulator  (joystick)  is  fed  back  to  the 
summing  junction. 

The  physical  configuration  of  the  tracking  system  hardware  is  shown 
in  Figure  7  and  contains  all  the  elements  described  above. 

One  paraplegic  anJ.  three  normal  subjects  have  been  tested  in  two 
one-dimensional  and  one  two-dimensional  compensatory  tracking  task.  The 
one-dimensional  tasks  are  (1)  tracking  in  the  anterior/posterior  plane  and 
(2)  tracking  in  the  lateral  plane.  The  two-c' imensional  task  is  the 
combination  of  these  individual  tracking  tasks. 

For  each  experiment  the  forcing  functions  (c^  and  c^),  the  tracking 
error  (e.  and  e2>,  the  force  response  (r^  and  r^),  and  aZ100  Hz  trigger 
signal  synchronous  with  the  stimulation  rate  are  recorded  on  analog  tape. 
The  analog  data  is  later  digitized  at  the  100  Hz  rate  and  finally  analyzed 
at  a  10  Hz  rate.  Proper  precautions  are  taken  to  reduce  aliasing  due  to 
sampling  by  appropriately  filtering  the  data  at  each  step  of  the  operation. 

The  time  history  of  the  recorded  signals  for  a  typical  tracking  run 
are  shown  in  Figure  8.  The  similarity  between  the  forcing  function  and 
the  force  response  are  obvious.  The  tracking  error  is  plotted  on  an 
expanded  scale. 


ANALYSIS 


Quasilinear  describing  functions,  based  on  spectral  analysis,  are 
utilized  to  represent  the  human  operator  model  in  this  tactile  compensatory 
tracking  task.  The  model  for  the  human  operator  in  a  single  axis  task  is 
shown  in  Figure  9.  Models  of  this  form  have  been  extensively  discussed  in 
zh o  literature  on  human  operators,  such  as  Reference  6.  The  open  loop 
describing  function  for  the  human  operator  (Reference  6)  is  found  to  be 

s  ,(J*> 

VH  «»>  ■  S-Ori  .  «> 

cs 

where  S  ( juo )  and  S  (jx)  are  the  input-output  and  input-error  cross¬ 
power  specrtal  densities. 

Estimates  of  the  cross-power  spectral  densities  are  obtained  by  a 
method  of  averaging  modified  periodograms  as  described  in  Reference  7.  In 
this  procedure  the  time  domain  data  is  sectioned  into  M  overlapping  segments 
of  length  N.  The  overlap  here  has  been  chosen  to  be  N/2  in  order  to 
minimize  the  variance  of  the  spectral  estimate  for  the  total  available 
data  length.  Each  data  section  is  tapered  in  some  fashion*  the  Hamming 
Window  was  chosen  to  perform  this  function.  The  Fourier  Transform,  defined 
as 


N^l  .  2rmk 

}C=Uxwe*J  N  k=0,  1,...,  N-l  (2) 

K  n=0  n  n 
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is  calculated  for  each  windowed  section  utilizing  a  standard  FFT  algorithm. 
The  modified  periodogram 


V'V  = 


At 

NU 


m  !)»»•)  M 


(3) 


is  formed  where  U  is  a  function  of  the  total  energy  in  the  window  and  it  is 
the  time  between  samples.  The  estimate  of  the  Spectral  Density  is  then 
obtained  bv  averaging  the  modified  periodograms  according  to  the  equation 


1  M 

S  Cb)  *  w  1 

vex  M  m-  i 


m<V 


(4) 


Cross-power  spectral  density  estimates  are  obtained  by  following  the  same 
procedure,  except  in  this  case  the  modified  periodogram  is  defined  as 


1  (x  )  = 
m  k 


-AL-  |  X*  Y,  I 
NU  1  k  k 1 


(5) 


where  X  and  Y  are  the  Fourier  transforms  of  the  two  signals  under  considera¬ 
tion  and  the  asterisk  indicates  the  complex  conjugate. 

For  this  particular  analysis,  ten  sections  of  1024  points  overlapped 
bv  512  points  were  used.  Under  these  conditions  it  can  be  shown,  Reference  7, 
pile  variance  of  the  spectral  estimate  is  given  b^ 


Var 


s  u. )  i 

xx  k 
9.104 


(6) 


Utilizing  this  technique  to  estimate  the  PSD  of  the  forcing  function 
cives  the  results  presented  in  Figure  10.  The  time  domain  signal  Cl  is 
shown  in  the  top  portion  of  the  figure  and  the  estimate  of  its  PSD  (GC1C1) 
is  shown  in  the  lower  portion  of  the  figure. 

In  order  to  investigate  the  cross  coupling  in  the  human  operator  for 
the  two  dimensional  tracking  task,  the  closed  loop  cross-coupled  model 
shown  in  Figure  11  is  utilized.  Ordinary,  partial  and  multiple  coherence 
functions  can  be  calculated  as  described  in  Reference  7.  For  notational 
convenience  the  frequency  dependence  of  all  terms  in  the  following 
equations  has  been  dropped.  The  ordinary  coherence  between  input  c] 
output  r^  is  given  by 


and 


I  s 


c  r  1 

11 


clrl 


(7) 


C1C1 


rlrl 
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This  function  describes  the  portion  of  the  power  contained  in  the  output 
r-  which  is  attributable  to  c^  passed  through  a  linear  system  H^.  The 
ordinary  coherence  for  is  obtained  by  changing  all  the  subscripts 
from  one  (1)  to  two  (2). 

The  partial  coherence  between  input  c^  and  output  r^  is  given  by 


2 

Pc2rrci 


3  S  -  S*  S 

C1C1  C2rl  C1C2  Clrl  1 

!  S  S  I  1-p  2  If  L-p2  I 

:1C1  C2C2  Vl  '  C1C2^  Clrl' 


(8) 


This  function  describes  the  portion  of  the  power  contained  in  the  output 
r.  which  is  attributable  to  c^  passed  through  a  linear  system  H^?,  with 
all  effects  of  c^  removed.  i<" 

The  multiple  coherence  function 


L-p2  ]  [  l-p2  I 

ciri  1 1  c2rrcJ 


(9) 


describes  the  portion  of  the  output  power  attributable  to  both  inputs  via 
linear  systems.  For  equations  8  and  9,  the  coherence  functions  for  the 
other  input  can  be  determined,  as  it  was  for  equation  7,  by  interchanging 
the  numerical  subscripts. 

The  remnant,  that  is  the  remaining  output  power  which  is  not  explained 
by  the  linear  system,  may  be  found  from 


G 


-  (I’P2) 


G 

rr 


(10) 


where  G^  is  the  remnant  power.  When  the  coherence  (p  )  is  large,  that  is 
as  it  approaches  1,  the  remnant  power  can  be  seen  to  approach  0. 


RESULTS 


The  results  of  a  typical  anterior/posterior  tracking  experiment  are 
shown  in  Figure  12.  The  magnitude  of  the  describing  function  YH1  is 
given  in  the  top  left  of  the  figure.  The  phase  of  YH1  is  given  in  the 
lower  left  of  the  figure  and  the  coherence  function  for  this  task  is 
presented  at  the  right.  The  vertical  bars  indicate  the  95  percent  confidence 
intervals  for  the  estimates  of  magnitude  and  phase.  The  phantom  curve  is 
a  visual  fit  to  the  estimated  magnitude  and  phase.  The  equation  of  this 
model  is 
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The  coherence  function  indicates  that  a  linear  model  describes  most  of  the 
output  power  to  a  frequency  just  beyond  1  radian  per  second  for  this 
particular  task.  The  main  difference  between  this  model  and  those  obtained 
from  visual  tracking  tasks  is  the  delay  time.  For  tactile  tracking  delay 
times  of  250  to  300  msec  have  been  found  where  delay  times  of  100  to 

1 50  msec  are  reported  for  visual  tracking*  * 

The  results  of  the  coherence  function  analysis  for  the  two-dimensional 
task  are  shown  in  Figures  13  and  14.  The  results  for  the  Anterior/Posterior 
(A/P)  output  (Figure  13)  indicate  that  the  ordinary  and  multiple  coherence 
functions  are  almost  identical.  This  is  due  to  the  extremely  low  ccnerence 
o'  the  A/P  output  with  the  Lateral  input  as  shown  in  the  partial  coherence 
Lctlo-  on  the  tight  side  of  the  figure.  This  lath  of  coherence  leads  to 
the  conclusion  that  there  is  no  linear  system  which  can  explain  the  A/P 
output  power  from  the  Lateral  input.  Figure  14  presents  the  same  results 
for  the  Lateral  output.  Again  the  contribution  of  the  cross-coupling  term 
in  explaining  the  Lateral  output  is  not  significant.  This  analysis  snows 
that  for  the  task  considered  here,  the  subject  is  capable  ot  uncoupling 
the  two-dimensional  task  into  two  one-dimensional  tasks.  Therefore,  -t 
suffices  to  model  the  subject  in  a  two-dimensional  tracking  task  by  using 
two  one-dimensional  models  as  shown  in  Figure  9,  represented  by  an  equation 

of  the  form  given  in  equation  11.  ,  . 

The  parameters  of  equation  11  will  vary  as  a  function  of  task 
dimensionality.  This  can  be  seen  by  comparing  the  open  loop  describing 
function  magnitude  for  one-  and  two-dimensional  tactile  tracking  (see 
Ficure  15).  The  describing  function  bandwidth  is  reduced  when  the 
complexity  of  the  task  is  increased.  In  addition  to  that  change,  the 
time  delav  associated  with  two-dimensional  tactile  tracking  is  approx-matel) 
twice  that  of  one -dimensional  tactile  tracking,  or  500  msec.  .  . 

Tracking  tests  accomplished  with  an  L3  paraplegic  indicate  that  ms 
ability  to  track  the  signal  is  as  good  as  that  of  normals. 


CONCLUSIONS 


The  feasibility  of  tracking  with  a  tactile  display  using  electro- 
cutaneous  stimulation  at  the  waist  has  been  demonstrated  in  one-  and  two- 
dimensions.  Quasilinear  models  have  been  fitted  to  the  data.  The  cross¬ 
coupling  terms  in  the  model  are  negligible;  hence,  for  the  range  of  frequen¬ 
cies  used  as  forcing  functions  in  this  study,  the  subjects  are  capable  of 
decoupling  a  two-axis  tracking  task  into  two  independent  single  axis  tasks. 
Paraplegics  seem  to  be  able  to  track  with  this  type  of  display  as  efticiently 
as  normal  Individuals. 
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A  SIMULATOR  STUDY  OF  HELICOPTER  PILOT  WORKLOAD 

REDUCTION  USING  A  TACTILE  DISPLAY 

An  informal  report  presented  to  the 
Twelfth  Annual  Conference  on  Manual  Control 

By  Richard  S.  Dunn,  Richard  D.  Gilson,  Pershing  Sun 


Mission  performance  in  current  tactical  Army  helicopter  operations  is 
significantly  limited  by  high  visual  workload.  Aircrew  tasks  in  low  level 
or  terrain  flight  demand  continuous  outside  visual  reference  to  fly,  navi— 

and  perform  mission  functions.  In  nap-of— the— earth  or  contour  flight, 
the  airspace  is  a  ground-defined  environment  in  which  operators  strive  for 
masking  cover,  speed  and  accurate  navigation.  This  kind  of  visual  contact 
flight  leaves  no  time  for  scanning  display  devices  in  the  cockpit.  These 
considerations  have  generated  interest  in  a  simple  tactile  display  concept 
which  may  form  the  basis  of  an  effective  information  display  strategy  for 
Army  helicopter  cockpits. 

^  brief  report  of  a  limited  simulation  experiment  intended 
to  test  the  feasibility  of  a  tactile  display.  It  had  two  objectives:  to 
determine  whether  the  adaptive  measurement  procedure  developed  by  Dr.  Per¬ 
shing  Sun  at  the  Avionics  Laboratory  (Reference  1)  could  be  used  to  evaluate 
novel  display  devices  in  tactical  operations;  and  to  determine  whether  a 
tactile  display  could  provide  useful  reductions  in  visual  workload  in  tac¬ 
tical  helicopter  operations. 


The  laboratory  prototype  single  channel  tactile  display  used 

tests  has  been  described  in  previous  reports  to  this  conference 
and  is  shown  in  figure  1. 


for  these 

(Reference 


2) 


Figure  1.  Tactile  Display  Procotype 
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„  ,n  hoth  directions  to  cue  a  2-way  tracking  control  movement 
The  piston  moves  in  both  d  operator  tracks  the  output  to  ob¬ 
it  is  a  compensatory  display txn  use, 3th  ^  ^  ^  Arny  a 

sr.'sss-:  the 

-ISIu;,  on  the  collective  lever  Co  proven  ,lci- 
tude  or  power  commands  as  shown  in  figure  2. 


Figure  2. 


TASS  Cockpit 


with  Tactile  Display  on  Collective  Control 


T»o  separ.te  — 

with  a  deceleration  phase  in  the  last  -.uu^te  ^  z-clile  c.jeir.g  was  ccm- 

for  this  task  r®preser.tea  a  a 'four  cue  electromechanical 

parea  with  the  normal^gl  d  gcoring  procedure  was  emploved  as  the 

f lignt  direc  •  •  described  this  procedure  in  detail  at  pre- 

dnpnndonc  variable.  Dr  Sun  hnsjon..  ^  t„e  vehlcl«  slovly 

vious  meetings.  Briefly,  .  in  the  cvcle  roll  control, 

.degraded  by  the.mtroduct ^^^iJe^u^ii' «  pri«ary  task  Per- 
The  amount  or  tine  del  y  indicates  that  the  equally  weighted  sum  or 

formance  measurement  circ  and  heading  (che  £our  commands  of  the 

errors  in  vertical,  iate  ,  •  value>  xhis  ?rocedure  has  the  disad- 

flight  director)  are  at  3  handling  qualities,  but  the  advantage  that 

vantage  of  altering  the  chance.  He  flies  just  at 

every  subject  gives  the  s  adaptive  measurement,  primary  task 

the  acceptable  error  giterion.  £th  ^  ^  reached. 

perrormance  alters  prima  V  after  reaching  criterion  is  taken  as  * • 

The  amount  or  tine  deiay  P  comblned  display-control-task  combination, 

indicator  o  ^or  qualified  helicopter  pilots  each  made  five  ILS  ap- 

After  practice,  fiv  <1  visual  cue  on  the  flight  director  and  five 

Stb  the  tactile  display.  Typical  results  are  shoun  In  figure  1. 
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Figure  3.  Tracking  Command  and  lime  Delay  Results 


The  two  tracings  on  top  show  the  glide  slope  error  (or  command)  signal: 
first  visual  tracking,  and  below  it  tactile  tracking.  Time  delay  acnievec 
is  taken  after  the  build  up  period,  the  first  90  seconds,  has  elapsed.  All 
subiects  on  all  trials  showed  a  greater  time  delay  when  using  the  tactile 
display.  Over  all  subjects  and  trials,  the  increase  in  time  delay  using 
the  tactile  display  averaged  Jed.  Inis  along  with  the  higher  rrequency  or 
response  to  altitvde  seen  in  the  tracings  for  tne  collective  command  support 
the  hypothesis  that  visual  workload  relief  is  present  and  is  efrective  at 
enhancing  system  performance. 

But  visual  scanning  in  a  flight  director  is  not  clearly  an  example 
of  the  kind  of  visual  workload  present  in  tactical  helicopter  operations 
To  get  closer  to  the  operational  environment  or  interest  a  second,  basical _v 
different,  simulation  was  made. 

For  this  task  a  visual  scene  was  employed,  generated  by  a  300  to  1 
scale  moving-belt  terrain  model.  The  display  was  a  small  black  and  vmte 
TV  without  collimated  viewing.  The  task  required  hovering  over  a  point  at 
an  assigned  heading.  Changes  in  altitude  were  commanded  to  simulate  a  _ 
tactical  bob-up  maneuver.  The  helicopter  math  model  employed  represented 
an  OH-6  scout  helicopt  . 


Command  information  shown  in  Figure  4  was  superimposed  over  the 
video  display.  It  was  the  Integrated  Trajectory  Error  Display  operating 
in  the  hover  display  mode  (Reference  3).  It  provides  both  attitude  ana 
position  cues  which  can  be  employed  for  hover  position  tasks.  The  cross 
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is  a  movable  indicator  of  desired  ground  position.  The  circle  is  a  fixed 
ownship  position  indicator.  The  line  displays  velocity,  both  magnitude 
and  direction,  while  the  small  bug  is  an  acceleration  cue.  For  these  tests, 
the  left  hand  vertical  band  displayed  absolute  altitude  with  a  collective 
command  on  the  left  inner  scale.  The  heading  command  was  provided  by  the 
indicator  at  the  center  bottom  of  the  display. 


Figure  4. 


Integrated  Trajectory  Error  Display  -  Command  Information 


The  task  required  hovering  in  position  while  folding  an  assigned 
heading,  and  altitude.  After  one  minute  a  maximum  performance  climb  was 
required  to  a  new  altitude  with  subsequent  holding  at  the  new  level.  This 
was  followed  by  descent  to  the  original  altitude.  The  change  was  from 
25  feet  to  125  feet  and  back.  Auditory  warning  signals  were  presented  to 
alert  the  pilot  to  upcoming  altitude  changes. 

To  estimate  the  effect  of  the  tactile  display  on  visual  workload  a 
visual  secondary  task  was  employed  as  the  dependent  variable.  As  before, 
the  adaptive  circuits  computed  primary  task  performance  and  provided  a 
weighted  sum  of  error  scores.  But  the  output  was  not  used  to  alter  the 
vehicle  handling  qualities  and  no  time  delays  were  applied.  Instead,  the 
adaptive  measurement  procedure  was  used  to  alter  the  difficulty  of  a  second' 
ary  task  by  turning  the  task  on  and  off. 

The  task  involved  cancelling  lights  and  was  similar  to  a  secondary 
visual  task  employed  by  Kelley  and  Wargo  (Reference  4)  .  Two  lamps  were 
mounted  in  the  peripheral  field  of  view  so  that  it  was  clear  when  one 
came  on,  but  a  shift  in  gaze  away  from  the  main  display  was  required  to 
determine  which  one.  Lights  were  illuminated  randomly  one  half  second 
after  the  previous  response.  Cancelling  required  operation  of  the  appro¬ 
priate  left  or  right  rocker  switch  on  the  cyclic  grip.  These  were  stan-- 
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, ,  -„cu«.f-n-. talk,  switches .  When  the  primary 

dard  Army  intercom  or  **  dP  rformance  within  the  error  criteria,  the 
cask  scoring  circuits  J"° Ced  P  when  performance  exceeded  the  criteria, 
secondary  task  was  activate  ,  Th  the  situation  represented  cross- 
the  secondary  task  was  deleted^ ^difficulty  by  measurement  of  primary 
adaptive  control  elpected"  the  basic  flight  task  proved  to 

r/!:«rS““u  « *«,  •«> ■» 

motion  cues. 

Only  three  subjeC“gf  lKrof'co^rol!  ‘iTis  important 

able  to  perform  tha  ‘fV^oss-adaptive  procedure  resulted  in  stable  sys- 
to  note,  however,  that  the  c  errorPcritePia  for  aU  these  subjects.  Each 

£  SS^SrJhft-  “ix  times  with  the  visual  cue  and  six  times  with  the 
tactile  display. 

Here  the  result,  else  favored  Che  m.Tecoida^  cash 

trials  Wlch  me  tactile  cue  resu  ly  longer>  buc  with  more  chan  15 

about  the  same  amount  of  tim  ^g^  responses  when  using  the  tactile 

Tisotav  “lb  “supports  Che  hypothesis  of  equal  syscem  performance  »ich 
an  increase  in  visual  uorkload  reserve. 

Unfortunately,  scheduling  for  b°^h  ^^^“solidify  these  findings, 

facility  prevented  test and  only  suggestive.  But 
They  must  therefore  be  classed^  ^  contention  that  the  tactile  display 
they  are  encouraging  and  PPduc(;lons  in  helicopter  p  >t  visual  workload  - 
that  the  workload  reductions  can  tree  visual  attention 
for  other  productive  tasks. 

this  research  uas  Jointly  sponsored  by  “Jborlc^X^y 
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